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NUCLEIC ACID MEDIATED DISRUPTION OF HIV FUSOGENIC PEPTIDE 

INTERACTIONS 

This application is a continuation-in-part of PCT/US03/05190 filed February 20, 
5 2003, which claims the benefit of McSwiggen et al 9 U.S. Provisional Application Serial 
No. 60/398,036, filed July 23, 2002, of McSwiggen U.S. Provisional Application Serial 
No. 60/374,722, filed April 22, 2002, of Beigelman U.S. Provisional Application Serial 
No. 60/358,580, filed February 20, 2002, of Beigelman U.S. Provisional Application 
Serial No. 60/363,124, filed March 11, 2002, of Beigelman U.S. Provisional Application 

10 Serial No. 60/386,782, filed June 6, 2002, of Beigelman U.S. Provisional Application 
Serial No. 60/406,784, filed August 29, 2002, of Beigelman U.S. Provisional Application 
Serial No. 60/408,378, filed September 5, 2002, of Beigelman U.S. Provisional 
Application Serial No. 60/409,293, filed September 9, 2002, and of Beigelman U.S. 
Provisional Application Serial No. 60/440,129, filed January 15, 2003 and which is a 

15 continuation-in-part of McSwiggen et ai, U.S. Patent Application Serial No. 10/225,023, 
filed August 21, 2002, which is a continuation-in-part of McSwiggen et al, U.S. Patent 
Application Serial No. 10/157,580, filed May 29, 2002, which claims the benefit of 
McSwiggen U.S. Provisional Application Serial No. 60/294,140, filed May 29, 2001. 
These applications are hereby incorporated by reference herein in their entireties, 

20 including the drawings. 

BACKGROUND OF THE INVENTION 

The present invention concerns compounds, compositions, and methods for the 
study, diagnosis, and treatment of degenerative and disease states related to Human 
hnmunodeficiency Virus (HIV) infection and/or acquired immunodeficiency syndrome 
25 (AIDS). Specifically, the invention relates to nucleic acid molecules used to inhibit HIV 
cell fusion and entry via disruption of fusogenic peptide interactions. 

Human immunodeficiency virus type I (HIV-1) enters permissive cells by binding 
to the cellular receptor, CD4, followed by fusion of the viral and target cell membranes. 
Fusion results in viral entry into the target cell followed by integration and expression of 
30 the HIV-1 genome. The HIV-1 envelope glycoprotein mediates the fusion process 
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through interaction with cellular receptors. The HIV-1 envelope glycoprotein is 
synthesized as a precursor protein, gpl60, which is proteolytically processed to generate 
two subunits, the surface glycoprotein gpl20 and the transmembrane glycoprotein gp41. 
These subunits remain noncovalently associated to form the oligomeric envelope 

5 glycoprotein spike on the viral membrane. Portions of gpl20 bind to the CD4 receptor 
and a chemokine receptor (typically CCR5 or CXCR4) on the surface of target cells. 
These events trigger gp41 to undergo conformational changes that promote fusion of viral 
and cellular membranes, resulting in entry of the viral core into the cell. By analogy with 
the pH-induced structural changes in the hemagglutinin (HA) protein of influenza virus, 

10 the HIV-1 fusion activation process likely involves substantial conformational changes 
from a pre-fusogenic state to a fusogenic conformation. 

The structure of the ectodomain of both HIV and SIV gp41 in the fosogenic/post- 
fusogenic state has been characterized by NMR and crystallography. These studies have 
shown that gp41 consists of a trimer of hairpins. In the fusogenic conformation of gp41, 

1 5 three N-terminal helices form a trimeric coiled coil, and three C-terminal helices pack in 
the reverse direction into three hydrophobic grooves on the surface of the coiled coil, 
bringing the amino and carboxy termini of the ectodomain together. Because the 
membrane anchor and the fusion peptide of the gp41 ectodomain are embedded in the 
viral and target cell membranes respectively, the formation of a fusogenic hairpin 

20 structure results in the colocalization of the two membranes. Peptides corresponding to 
the C-terminal region, referred to as C peptides, can specifically inhibit viral entry into 
target cells at nanomolar concentrations. One such peptide (T-20) is in clinical study and 
has shown antiviral activity in humans. T-20 binds to gp41 only after interaction of the 
envelope glycoprotein complex with the cellular receptors. 

25 Jeffs et al, International PCT Publication No. WO 01/51673, describes isolated 

portions of gp41 protein (DP107 and DP178 domains) that are used to inhibit interaction 
between gp41 and gp 120 and prevent infectivity of HTV. 

Soukchareun et al, 1998, Bioconjugate Chemistry, 9, 466-475, describes the use of 
N-Fmoc-cysteine(S-thiobutyl) derivatized oligodeoxynucleotides for the preparation of 

30 certain gp41 peptide hybrid oligonucleotides having membranotropic activities. 
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SUMMARY OF THE INVENTION 

The present invention relates to nucleic acid molecules used to inhibit HIV cell 
fusion and entry via disruption of fusogenic peptide interactions. The invention also 
relates to nucleic acid molecules directed to disrupt the function of the HIV-1 envelope 

5 glycoprotein, such as to inhibit CD4 receptor mediated fusion of HIV-1 . In particular, the 
present invention describes the selection and function of nucleic acid molecules, such as 
aptamers, capable of specifically binding to the HIV-1 envelope glycoprotein and 
modulating activity of the HIV-1 envelope glycoprotein or components thereof. These 
nucleic acid molecules can be used to treat diseases and disorders associated with HTV 

1 0 infection, or as a prophylactic measure to prevent HTV- 1 infection. 

The nucleic acid aptamers of the invention can be used as antifusogenic and 
antiviral agents. The antifusogenic activity of the aptamers of the invention can result 
from the ability to modulate intracellular processes that involve coiled-coil peptide 
structures or protein-protein interactions. The antiviral activity of the aptamers of the 
15 invention includes but is not limited to the inhibition of HTV transmission to uninfected 

CD-4+ cells. 

The present invention also features the use of one or more nucleic acid-based 
techniques for modulating gene expression, such as nucleic acid aptamers, enzymatic 
nucleic acid molecules, small interfering RNA (siRNA), nucleic acid sensor molecules, 

20 allozymes, antisense nucleic acid molecules, 2,5-A nucleic acid chimeras, triplex 
oligonucleotides, and antisense nucleic acid molecules with nucleic acid cleaving groups, 
to modulate the activity, expression, or level of cellular proteins required for HTV cell 
fusion and entry. For example, the invention features the use of nucleic acid-based 
techniques to specifically modulate the activity and/or expression of proteins required for 

25 HTV cell fusion and entry. 

In one embodiment, the invention features antifusogenic nucleic acid aptamers 
directed to disrupt the function of HTV-1 envelope glycoprotein or components thereof 
and prevent viral membrane fusion and/or entry. The nucleic acid aptamers of the 
invention are designed to interact with subunits of the HTV-1 envelope glycoprotein, such 
30 as the gpl20 and gp41 subunits of the HTV-l envelope glycoprotein, and disrupt the 
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function of the HIV-1 envelope glycoprotein or components thereof. Such disruption of 
the HIV-1 envelope glycoprotein can be effected, for example, by preventing 
conformational changes to gpl20 or gp41, and/or preventing protein-protein interactions 
between gpl20 and/or gp41 or interactions within gpl20 and/or gp41. 

5 In another embodiment, the invention features antifusogenic nucleic acid aptamers 

having binding affinity to gp41 . Non-limiting examples of target regions within the gp41 
peptide sequence include sequences derived from the C-terminal region of gp41. For 
example, the present invention features aptamers having binding affinity to a peptide 
sequence corresponding to amino acids 638 to 673 of GP-41, and aptamers having 

10 binding affinity to a peptide sequence corresponding to amino acids 558 to 595 of GP-41 
(see for example Jeffs et al, US Patent Application No. 09/350,841, incorporated by 
reference herein in its entirety including the drawings). 

In yet another embodiment, the invention features antifusogenic nucleic acid 
aptamers having binding affinity to peptide sequences having SEQ ID No. 1233 and/or 
15 SEQ ID No. 1234 (Table XII) or functional equivalents thereof. For example, in certain 
embodiments, nucleic acid aptamers of the invention can have binding affinity to analogs 
of the peptides contemplated herein, such analogs can contain one or more amino acid 
truncations, deletions, insertions or substitutions. 

In one embodiment, the invention features an antifusogenic nucleic acid aptamer 
20 that specifically binds the HIV-1 envelope glycoprotein. In one embodiment, the 
invention features a nucleic acid aptamer that specifically binds the gp41 region of the 
HIV-1 envelope glycoprotein. In another embodiment, the invention features a nucleic 
acid aptamer molecule that specifically binds to the gpl20 region of the HIV-1 envelope 
glycoprotein. 

25 In one embodiment, nucleic acid aptamers of the invention act extracellularly and 

bind to their HIV-1 envelope glycoprotein targets outside of cells. Theses nucleic acid 
aptamers provide an attractive approach to treating HIV infection because they are able to 
act outside of cells or extracellularly. 

In another embodiment, the invention features a composition comprising a nucleic 
30 acid aptamer of the invention in a pharmaceutically acceptable carrier, hi another 
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embodiment, the invention features a mammalian cell, for example a human cell, 
comprising a nucleic acid aptamer contemplated by the invention. 

In one embodiment, the invention features a method for treatment of HIV-1 
infection and/or AIDS, comprising administering to a patient a nucleic acid aptamer of 
5 the invention under conditions suitable for the treatment. 

In another embodiment, the invention features a method of treatment of a patient 
having a condition associated with HIV-1 infection, comprising contacting cells of said 
patient with a nucleic acid aptamer of the invention under conditions suitable for such 
treatment. In another embodiment, the invention features a method of treatment of a 

10 patient having a condition associated with HIV-1 infection, comprising contacting cells of 
said patient with a nucleic acid aptamer of the invention, and further comprising the use 
of one or more drug therapies under conditions suitable for said treatment. Examples of 
suitable drug therapies include reverse transcriptase inhibitors such as zidovudine (AZT), 
zalcitabine (DDC), zidovudine (ZDV), lamivudine (3TC), didanosinedelavirdine (DDI), 

15 stavudine (D4T), abacavir, efavirenz, nevirapine, or tenofovir disoproxil fumarate, 
ribavirin and/or protease inhibitors such as indinavir, amprenavir, saquinavir, lopinavir, 
ritonavir, or nelfmavir, or any combination thereof. In another embodiment, the other 
therapy is administered simultaneously with or separately from the nucleic acid molecule. 

In another embodiment, the invention features a method for modulating HIV cell 
20 fusion in a mammalian cell comprising administering to the cell a nucleic acid molecule 
of the invention under conditions suitable for the modulation. 

In yet another embodiment, the invention features a method of modulating HIV cell 
fusion, comprising contacting a nucleic acid aptamer of the invention with HIV-1 
envelope glycoprotein, gpl20 and/or gp41 under conditions suitable for the modulating of 
25 the HIV cell fusion activity. 

In one embodiment, a nucleic acid molecule of the invention, for example an 
aptamer or enzymatic nucleic acid molecule, is chemically synthesized. In another 
embodiment, the nucleic acid molecule of the invention comprises at least one nucleic 
acid sugar modification. In yet another embodiment, the nucleic acid molecule of the 
30 invention comprises at least one nucleic acid base modification. In another embodiment, 
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the nucleic acid molecule of the invention comprises at least one nucleic acid backbone 
modification. 

In one embodiment, the nucleic acid molecule of the invention comprises one or 
more ribonucleotides. In another embodiment, the nucleic acid molecule of the invention 
5 comprises one or more deoxy ribonucleotides. 

In another embodiment, the nucleic acid molecule of the invention comprises at 
least one 2'-0-alkyl, 2'-alkyl, 2'-alkoxylalkyl, 2'-alkylthioalkyl, 2^amino, 2'-0-amino, 
or 2'-halo modification and/or any combination thereof with or without 2'-deoxy and/or 
2'-ribo nucleotides. In yet another embodiment, the nucleic acid molecule of the 
10 invention comprises all 2'-0-alkyl nucleotides, for example, all 2'-0-allyl nucleotides. 

In one embodiment, the nucleic acid molecule of the invention comprises a 5'-cap, 
3'-cap, or 5'-3' cap structure, for example, an abasic or inverted abasic moiety. 

In another embodiment, the nucleic acid molecule of the invention is a linear 
nucleic acid molecule. In another embodiment, the nucleic acid molecule of the invention 
15 is a linear nucleic acid molecule that can optionally form a hairpin, loop, stem-loop, or 
other secondary structure. In yet another embodiment, the nucleic acid molecule of the 
invention is a circular nucleic acid molecule. 

In one embodiment, the nucleic acid molecule of the invention is a single stranded 
oligonucleotide. In another embodiment, the nucleic acid molecule of the invention is a 
20 double-stranded oligonucleotide. 

In one embodiment, the nucleic acid molecule of the invention comprises an 
oligonucleotide having about 3 to about 500 nucleotides. In another embodiment, the 
nucleic acid molecule of the invention comprises an oligonucleotide having about 3 to 
about 24 nucleotides. In another embodiment, the nucleic acid molecule of the invention 
25 comprises an oligonucleotide having about 4 to about 16 nucleotides. 

In one embodiment, the nucleic acid aptamer of the invention binds to its 
corresponding fflV-1 envelope derived target, with a binding affinity of about 100 pM- 
100 nM or about 20 to 50 nM, for example, by non-covalent interaction of the nucleic 
acid aptamer with a gp41 or gpl20 derived peptide sequence, secondary or tertiary 
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structure. In another embodiment, the nucleic acid aptamer of the invention binds to the 
fflV-1 envelope glycoprotein target with a binding affinity of less than about 20 nM. 

In another embodiment, the nucleic acid aptamer of the invention binds irreversibly 
to the HTV-1 envelope derived target, for example, by covalent attachment of the nucleic 
5 acid aptamer to gp41 or gpl20, or a gp41 or gpl20 derived peptide sequence, secondary 
or tertiary structure. The covalent attachment can be accomplished by introducing 
chemical modifications into the nucleic acid aptamer's sequence that are capable of 
forming covalent bonds to the HTV-1 envelope glycoprotein target sequence. 

In one embodiment, the invention features a composition comprising at least one 
10 HIV reverse transcriptase inhibitor and a nucleic acid molecule of the invention in a 
pharmaceutically acceptable carrier. In another embodiment, the invention features a 
composition comprising at least one HTV protease inhibitor and a nucleic acid molecule 
of the invention in a pharmaceutically acceptable carrier. In yet another embodiment, the 
invention features a composition comprising at least one HIV reverse transcriptase 
15 inhibitor, at least one HW protease inhibitor and a nucleic acid molecule of the invention 
in a pharmaceutically acceptable carrier. 

In another embodiment, the invention features a method of administering to a cell, 
for example a mammalian cell or human cell, a nucleic acid molecule of the invention 
independently or in conjunction with other therapeutic compounds such as HTV reverse 
20 transcriptase inhibitors and/or HIV protease inhibitors, comprising contacting the cell 
with the nucleic acid molecule and the HIV reverse transcriptase inhibitors and/or HIV 
protease inhibitors under conditions suitable for the administration. 

In yet another embodiment, the invention features a method of administering to a 
cell, for example, a mammalian cell or human cell, a nucleic acid molecule of the 
25 invention independently or in conjunction with other therapeutic compounds, such as 
enzymatic nucleic acid molecules, antisense molecules, triplex forming oligonucleotides, 
2,5-A chimeras, and/or RNAi molecules, comprising contacting the cell with the nucleic 
acid molecule of the invention under conditions suitable for the administration. 
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In another embodiment, administration of a nucleic acid molecule of the invention 
is administered to a cell or patient in the presence of a delivery reagent, for example a 
lipid, cationic lipid, phospholipid, or liposome. 

In one embodiment, the invention features a method for identifying nucleic acid 
5 aptamers having HIV anti-fusogenic properties comprising: (a) generating a randomized 
pool of oligonucleotides; (b) combining the oligonucleotides from (a) with gp41 in vitro 
under conditions suitable to allow at least one oligonucleotide to bind to the target gp41 
peptide; (c) removing non-bound oligonucleotide sequences from (b) under conditions 
suitable for isolating oligonucleotide sequences from (b) that possess binding affinity to 
10 gp41 by removing non-bound oligonucleotide sequences; (d) amplifying the 
oligonucleotide sequences isolated from (c) under conditions suitable for introducing 
some degree of mutation into the sequences; and (e) repeating steps (c) and (d) under 
conditions suitable for isolating one or more nucleic acid aptamers having binding affinity 
to gp41. 

1 5 In another embodiment, the invention features a method for identifying nucleic acid 

aptamers having HIV anti-fusogenic properties comprising: (a) generating a randomized 
pool of oligonucleotides; (b) combining the oligonucleotides from (a) with gpl20 in vitro 
under conditions suitable to allow at least one oligonucleotide to bind to the target gpl20 
peptide; (c) isolating oligonucleotide sequences from (b) that possess binding affinity to 
20 gpl20 by removing non-bound oligonucleotide sequences; (d) amplifying the 
oligonucleotide sequences isolated from (c) under conditions suitable for introducing 
some degree of mutation into the sequences; and (e) repeating steps (c) and (d) under 
conditions suitable for isolating one or more nucleic acid aptamers having binding affinity 
to gp210. 

25 In one embodiment, the invention features a method for identifying nucleic acid 

aptamers having HIV anti-fusogenic properties comprising: (a) generating a randomized 
pool of oligonucleotides; (b) combining the oligonucleotides from (a) with a target 
peptide derived from the HIV envelope glycoprotein in vitro under conditions suitable to 
allow at least one oligonucleotide to bind to the target peptide; (c) isolating 

30 oligonucleotide sequences from (b) that possess binding affinity to the target peptide by 
removing non-bound oligonucleotide sequences; (d) amplifying the oligonucleotide 
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sequences isolated from (c) under conditions suitable for introducing some degree of 
mutation into the sequences; and (e) repeating steps (c) and (d) under conditions suitable 
for isolating one or more nucleic acid aptamers having binding affinity to the target 
peptide. In the described methods, the random pool of oligonucleotides can comprise 
5 DNA and/or RNA, with or without chemically modified nucleotides. When chemically 
modified nucleotides are used in the method, such modifications can be chosen such that 
a non-discriminatory polymerase will incorporate the chemically modified nucleotide into 
the oligonucleotide sequence when generated or amplified. Non-limiting examples of 
chemically modified nucleoside triphosphates (NTPs) that can be used in the method of 
10 the invention include 2'-deoxy-2'-fluoro, 2'-deoxy-2'-amino, 2'-0-alkyl, and 2'-0- 
methyl NTPs as well as various base modified NTPs, such as C5-modified pyrimidines, 
2,6-diaminopurine, and inosine. The oligonucleotides used in the method can be of fixed 
or variable length. The target peptide derived from HIV envelope glycoprotein used in 
the method of the invention can comprise a synthetic or naturally occurring peptide that is 
15 synthesized or isolated from viral protein, for example by proteolytic cleavage. The 
target peptide can comprise sequence derived from proteins having sequence identical or 
similar to GenBank Accession Nos. AAM09869-AAM09880 or analogs thereof. For 
example, the target peptide can comprise sequences derived from gp41 or gpl20 that are 
essential for HTV membrane fusion and viral entry activity, such as SEQ ID NOs. 1233 
20 and/or 1234, and analogs thereof. These analogs can contain one or more amino acid 
truncations, deletions, insertions or substitutions. The conditions used in the method 
preferably provide nucleic acid aptamers that bind to their respective target in the 
conformation that the target adopts in its natural state. For example, peptide targets and 
binding conditions are chosen such that the isolated aptamer binds to its target site within 
25 the HTV envelope glycoprotein such that fusogenic activity of the protein is disrupted, 
such as by preventing intermolecular or intramolecular protein-protein interactions. The 
nucleic acid aptamers thus isolated by methods of the invention can be tested, for 
example, for an ability to inhibit cell fusion or viral activity using assays described herein. 

In another embodiment, the method for identifying nucleic acid aptamers having 
30 HIV anti-fusogenic properties comprises attaching the target protein or peptide sequence 
to a solid matrix, such as beads, microliter plate wells, membranes, or chip surfaces. In 
such a system, the target protein/peptide can be attached to the solid matrix either 
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covalently or non-covalently. In yet another embodiment, the oligonucleotide or nucleic 
acid aptamer used in a method of the invention can be labeled, either directly or non- 
directly, for example with a radioactive label, absorption label such as biotin, or a 
fluorescent label such as fluorescein or rhodamine. 

5 In one embodiment, the invention features novel nucleic acid-based techniques such 

as nucleic acid aptamers, used alone or in combination with enzymatic nucleic acid 
molecules, antisense molecules, and/or RNAi molecules, and methods for use to prevent 
HTV cellular fusion and entry or to down regulate or modulate the expression of HIV 
RNA and/or replication of HIV. 

10 In another embodiment, the invention features the use of one or more nucleic acid- 

based techniques, such as nucleic acid aptamers, enzymatic nucleic acid molecules, small 
interfering RNA (siRNA), nucleic acid sensor molecules, allozymes, antisense nucleic 
acid molecules, 2,5-A nucleic acid chimeras, triplex oligonucleotides, and antisense 
nucleic acid molecules with nucleic acid cleaving groups, to modulate the activity, 

15 expression, or level of cellular proteins required for HTV cell fusion and entry. For 
example, the invention features the use of nucleic acid-based techniques to specifically 
modulate the activity and/or expression of proteins required for HIV cell fusion and entry, 
-such as cellular receptors, cell surface molecules, cellular enzymes, cellular transcription 
factors, and/or cytokines, second messengers, and cellular accessory molecules. 

20 Examples of such cellular receptors involved in HIV infection contemplated by the 

instant invention include, but are not limited to, CD4 receptors, CXCR4 (also known as 
Fusin; LESTR; NPY3R, e.g., Genbank Accession No. NM 003467); CCR5 (also known 
as CKR-5, CMKRB5, e.g., Genbank Accession No. NM.000579); CCR3 (also known as 
CC-CKR-3, CKR-3, CMKBR3, e.g., Genbank Accession No. NM_001837); CCR2 (also 

25 known as CCR2b, CMKBR2, e.g., Genbank Accession Nos. NM 000647 and 
NM 000648); CCR1 (also known as CKR1, CMKBR1, e.g., Genbank Accession No. 
NM_001295); CCR4 (also known as CKR-4, e.g., Genbank Accession No. NM_005508); 
CCR8 (also known as ChemRl, TER1, CMKBR8, e.g., Genbank Accession No. 
NM.005201); CCR9 (also known as D6, e.g. Genbank Accession Nos. NM_006641 and 
30 NM_031200); CXCR2 (also known as IL-8RB, e.g., Genbank Accession No. 
NM 001557); STRL33 (also known as Bonzo; TYMSTR, e.g., Genbank Accession No. 
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NM_006564); US28; V28 (also known as CMKBRL1, CX3CR1, GPR13, e.g., Genbank 
Accession No. NM 001337); gprl (also known as GPR1, e.g., Genbank Accession No. 
NM 005279); gprlS (also known as BOB, GPR15, e.g., Genbank Accession No. 
NM.005290); Apj (also known as angiotensin-receptor-like, AGTRL1, e.g., Genbank 
Accession No. NM_005161); and ChemR23 receptors (e.g., Genbank Accession No. 
NM_004072). 

Examples of cell surface molecules involved in HIV infection contemplated by the 
instant invention include, but are not limited to, Heparan Sulfate Proteoglycans, HSPG2 
(e.g., Genbank Accession No. NM 005529); SDC2 (e.g., Genbank Accession Nos. 
AK025488, J04621, J04621); SDC4 (e.g., Genbank Accession No. NM 002999); GPC1 
(eg Genbank Accession No. NM_002081); SDC3 (e.g., Genbank Accession No. 
NM 014654); SDC1 (e.g., Genbank Accession No. NM_002997); Galactoceramides 
(e.g., Genbank Accession Nos. NM_000153, NM.003360, NM 001478.2, NM.004775, 
and NM_004861); and Erythrocyte-expressed Glycolipids (e.g., Genbank Accession Nos. 
NM_003778, NM_003779, NM_003780, NM_030587, and NM_001497). 

Examples of cellular enzymes involved in HIV infection contemplated by the 
invention include, but are not limited to, N-myristoyltransferase (NMT1, e.g., Genbank 
Accession No. NM_021079 and NMT2, e.g., Genbank Accession No. NM 004808); 
Glycosylate Enzymes (e.g., Genbank Accession Nos. NM.000303, NM_013339, 
NM_003358, NM_005787, NM_002408, NM_002676, NM_002435), NM 002409, 
NMJJ06122, NM_002372, NM_006699, NM_005907, NM_004479, NM_000150, 
NM_005216 and NM_005668); gp-160 Processing Enzymes (such as PCSK5, e.g., 
Genbank Accession No. NM 006200); Ribonucleotide Reductase (e.g., Genbank 
Accession Nos. NM.001034, NM_001033, AB036063, AB036063, AB036532, 
AK001965, AK001965, AK023605, AL137348, and AL137348); and Polyamine 
Biosynthesis enzymes (e.g., Genbank Accession Nos. NM_002539, NM 003132 and 
NM_001634). 

Examples of cellular transcription factors involved in HIV infection contemplated 
by the invention include, but are not limited to, SP-1 and NF-kappa B (such as NFKB2, 
) e.g., Genbank Accession No. NM_002502; RELA, e.g., Genbank Accession No. 
NM_021975; and NFKB1, e.g., Genbank Accession No. NM_003998). 
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Examples of cytokines and second messengers involved in HIV infection 
contemplated by the invention include, but are not limited to, Tumor Necrosis Factor-a 
(TNF-a, e.g., Genbank Accession No. NM_000594); Interleukin la (IL-la, e.g., Genbank 
Accession No. NM 000575); Interleukin 6 (IL-6, e.g., Genbank Accession No. 
5 NM 000600); Phospholipase C (PLC,e.g., Genbank Accession No. NM 000933); and 
Protein Kinase C (PKQe.g., Genbank Accession No. NM_006255). 

Examples of cellular accessory molecules involved in HIV infection contemplated 
by the invention include, but are not limited to, Cyclophilins, (such as PPID, e.g., 
Genbank Accession No. NM_005038; PPIA, e.g., Genbank Accession No. NM_021130; 
10 PPIE, e.g., Genbank Accession No. NM 006112; PPIB, e.g., Genbank Accession No. 
NM.000942; PPIF, e.g., Genbank Accession No. NM 005729; PPIG, e.g., Genbank 
Accession No. NM_004792; and PPIC, e.g., Genbank Accession No. NM_000943); 
Mitogen Activated Protein Kinase (MAP-Kinase, such as MAPKl, e.g., Genbank 
Accession Nos. NM_002745 and NM_138957); and Extracellular Signal-Regulated 
15 Kinase (ERK-Kinase). In one embodiment, nucleic acid molecules of the invention are 
used to treat HIV-infected cells or a HIV-infected patient wherein the HIV is resistant or 
the patient does not respond to treatment with current antiviral therapeutics such as HW 
reverse transcriptase or HTV protease inhibitors, either alone or in combination with other 
therapies under conditions suitable for the treatment. 

20 The present invention also features nucleic acid molecules capable of modulating 

gene expression, such as enzymatic nucleic acid molecules, small interfering RNA 
(siRNA), nucleic acid sensor molecules, allozymes, antisense nucleic acid molecules, 2,5- 
A nucleic acid chimeras, triplex oligonucleotides, and antisense nucleic acid molecules 
with nucleic acid cleaving groups, which down regulate expression of a sequence 

25 encoding a human immunodeficiency virus (such as HTV-1, HIV-2, and related viruses 
such as FIV-1 and SIV-1) envelope glycoprotein gene (env), for example Genbank 
accession number NC_001802 and/or sequences referred to in Table I. The sequence 
descriptions in Table I refer to composite names consisting of the following four parts: 
(a) HIV subtype (A, B, C, etc.); (b) Country of origin (US, JP, etc.); (c) Sampling year (2 
30 digits, a "-" means the sampling year isn't entered); and (d) Sequence name or isolate 



name. 
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The present invention features an enzymatic nucleic acid molecule comprising SEQ 
ID NOs. 505-905. The invention also features an enzymatic nucleic acid molecule 
comprising at least one binding arm wherein one or more of said binding arms comprises 
a sequence complementary to any of SEQ ID NOs. 1-395. 

In one embodiment, an enzymatic nucleic acid molecule of the invention is adapted 
to HTV infection or acquired immunodeficiency syndrome (AIDS). 

In another embodiment, the enzymatic nucleic acid molecule of the invention has 
an endonuclease activity to cleave RNA having HIV env sequence. 

In one embodiment, the enzymatic nucleic acid molecule of the invention is in an 
Inozyme, Zinzyme, G-cleaver, Amberzyme, DNAzyme Hairpin or Hammerhead 
configuration. 

In one embodiment, an enzymatic nucleic acid molecule of the invention comprises 
between 12 and 100 bases complementary to a RNA sequence encoding HTV env. In 
another embodiment, an enzymatic nucleic acid molecule of the invention comprises 
1 5 between 14 and 24 bases complementary to a RNA sequence encoding HIV env. 

In one embodiment, the Hammerhead of the invention comprises a sequence 
selected from the group consisting of SEQ ID NOs 505-561. 

In one embodiment, the Inozyme of the invention comprises a sequence selected 
from the group consisting of SEQ ID NOs. 562-637. 

20 In one embodiment, the G-cleaver of the invention comprises a sequence selected 

from the group consisting of SEQ ID NOs. 638-661. In one embodiment, the Zinzyme of 
the invention comprises a sequence selected from the group consisting of SEQ ID NOs. 
662-705. 

In one embodiment, the DNAzyme of the invention comprises a sequence selected 
25 from the group consisting of SEQ ID NOs. 706-806. 

In one embodiment, the Amberzyme of the invention comprises a sequence selected 
from the group consisting of SEQ ID NOs 807-905. 
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In one embodiment, the antisense moleeule of the invention comprises a sequence 
complementary to a sequence of SEQ ID NOs. 1-395. In another embodiment, the 
antisense molecule of the invention comprises a sequence selected from the group 
consisting of SEQ ID Nos. 906-1014. 
5 In one embodiment, the siRNA molecule of the invention comprises a sequence 

complementary to a sequence of SEQ ID NOs. 1-395. In another embodiment, the 
siRNA molecule of the invention comprises a duplex of sequences selected from the 
group consisting of SEQ ID Nos. 1015-1232. 

In another embodiment, a nucleic acid molecule of the invention is chemically 
10 synthesized. A nucleic acid molecule of the invention can comprise at least one 2'-sugar 
modification, at least one nucleic acid base modification, and/or at least one phosphate 
backbone modification. 

hi one embodiment the present invention features a mammalian cell comprising a 
nucleic acid molecule of the invention. In one embodiment, the mammalian cell of the 
1 5 invention is a human cell. 

The invention features a method of reducing HIV activity in a cell comprising 
contacting the cell with a nucleic acid molecule of the invention under conditions suitable 
for the reduction of HTV activity. 

The invention also features a method of treating a patient having a condition 
20 associated with the level of HW comprising contacting cells of the patient with a nucleic 
acid molecule of the invention under conditions suitable for the treatment. 

In one embodiment, methods of treatment contemplated by the invention comprise 
the use of one or more drug therapies under conditions suitable for the treatment. 

The invention features a method of cleaving RNA of a HIV env gene comprising 
25 contacting a nucleic acid molecule of the invention with the RNA of HIV env gene under 
conditions suitable for the cleavage. In one embodiment, the cleavage contemplated by 
the invention is carried out in the presence of a divalent cation, for example Mg2+. 
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m another embodiment, the nucleic acid molecule of the invention comprises a cap 
structure, wherein the cap structure is at the S'-end, or 3>-end, or both the 5'-end and the 
3>-e„d of the enzymatic nucleic acid molecule, for example, a 3\3'-linked or 5',5'-hnked 
deoxyabasic ribose derivative. 

5 The present invention features an expression vector comprising a nucleic acid 

sequence encoding at least one nucleic acid molecule of the invention in a manner which 
allows expression of the nucleic acid molecule. 

The invention also features a mammalian cell, for example, a human cell 
comprising an expression vector contemplated by the invention. 

10 In one embodiment, an expression vector of the invention comprises a nucleic acid 

sequence encoding two or more nucleic acid molecules, which may be the same or 
different. 

The present invention features a method for treatment of acquired 
immunodeficiency syndrome (AIDS) or an AIDS related condition, for example Kaposi's 
15 sarcoma, lymphoma, cervical cancer, squamous cell carcinoma, cardiac myopathy, 
rheumatic disease, or opportunistic infection, comprising administering to a patient a 
nucleic acid molecule of the invention under conditions suitable for the treatment. 

In one embodiment, a nucleic acid molecule of the invention comprises at least five 
ribose residues, at least ten 2'-0-methyl modifications, and a 3'- end modification, for 
20 example, a 3 '-3 'inverted abasic moiety. 

In another embodiment, a nucleic acid molecule of the invention further comprises 
phosphorothioate linkages on at least three of the 5' terminal nucleotides. 

In yet another embodiment, a DNAzyme of the invention comprises at least ten T- 
O-methyl modifications and a 3'-end modification, for example a 3'-3' inverted abasic 
25 moiety. In a further embodiment, the DNAzyme of the invention further comprises 
phosphorothioate linkages on at least three of the 5' terminal nucleotides. 
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In another embodiment, other drug therapies of the invention comprise antiviral 
therapy, monoclonal antibody therapy, chemotherapy, radiation therapy, analgesic 
therapy, or anti-inflammatory therapy. 

In yet another embodiment, antiviral therapy of the invention comprises treatment 
with zidovudine (AZT), zalcitabine (DDC), zidovudine (ZDV), lamivudine (3TC), 
didanosinedelavirdine (DDI), stavudine (D4T), abacavir, efavirenz, nevirapine, or 
tenofovir disoproxil fumarate, ribavirin and/or protease inhibitors such as indinavir, 
amprenavir, saquinavir, lopinavir, ritonavir, or nelfinavir, or any combination thereof. 

The invention features a composition comprising a nucleic acid molecule of the 
invention in a pharmaceutically acceptable carrier. 

In one embodiment, the invention features a method of administering to a cell, for 
example a mammalian cell or human cell, a nucleic acid molecule of the invention 
comprising contacting the cell with the nucleic acid molecule under conditions suitable 
for the administration. The method of adminislration can be in the presence of a delivery 
reagent, for example, a lipid, cationic lipid, phospholipid, or liposome. 

The term "antifusogenic" as used herein refers to the ability of a compound to 
inhibit or reduce the level of membrane fusion events between two or more moieties 
relative to the level of membrane fusion which occurs between the moieties in the 
absence of the compound. The moieties can be, for example, cell membranes or viral 
structures, such as viral envelopes or pili. Antifusogenic compounds can exert their effect 
by modulating protein-protein interactions or by modulating intracellular events involving 
coiled-coil peptide structures. 

The term "antiviral" as used herein refers to the ability of a compound to inhibit or 
reduce viral infection of cells, for example, by inhibiting cell-cell fusion or free virus 
infection. The antiviral activity of the compound can result from antifusogenic activity or 
by preventing viral replication and/or expression, such as by modulating the expression of 
the viral genome. 

The term "modulate" as used herein refers to a stimulatory or inhibitory effect on 
the intracellular or intercellular process of interest relative to the level or activity of such 
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a process in the absense of a nucleic acid molecule of the invention. For example, the 
level of membrane fusion events between two or more moieties is enhanced or decreased 
in the presence of a modulator relative to the level of membrane fusion which occurs 
between the moieties in the absence of the modulator. In another non-limiting example, 
the expression of the gene, or level of RNA molecules or equivalent RNA molecules 
encoding one or more proteins or protein subunits, or activity of one or more proteins or 
protein subunits is up regulated or down regulated, such that expression, level, or activity 
is greater than or less than that observed in the absence of the modulator. For example, 
the term "modulate" can mean "inhibit," but the use of the word "modulate" is not limited 
to this definition. 

The term "inhibit" as used herein refers to when the activity of HIV envelope 
glycoprotein, or level of KNAs or equivalent RNAs encoding one or more protein 
subunits of HTV envelope glycoprotein or functional equivalents thereof, is reduced 
below that observed in the absence of the nucleic acid of the invention. In one 
embodiment, inhibition with nucleic acid molecule preferably is below that level 
observed in the presence of non-binding or an inactive or attenuated molecule that is 
unable to bind to the same target site. In another embodiment, inhibition of HIV gene 
expression, cell fusion or cell entry with the nucleic acid molecule of the instant invention 
is greater in the presence of the nucleic acid molecule than in its absence. 

The methods of this invention can be used to treat HIV infections, which include 
productive virus infection, latent or persistent virus infection. The utility can be extended 
to other species of HIV that infect non-human animals where such infections are of 
veterinary importance. 

By "aptamer" or "nucleic acid aptamer" as used herein is meant a nucleic acid 
molecule that binds specifically to a target molecule. The target molecule can be any 
molecule of interest. For example, the aptamer can be used to bind to a ligand-binding 
domain of a protein, thereby preventing interaction of the naturally occurring ligand with 
the protein. The aptamer can also be used to prevent protein-protein interactions or 
conformational changes within a protein by binding to a portion of a target protein that 
interacts with another protein or with another portion of the same protein. This is a non- 
limiting example and those in the art will recognize that other embodiments can be 
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readily generated using techniques generally known in the art, see for example Gold et 
al. 1995, Anna. Rev. Biochem., 64, 763; Brody and Gold, 2000, J. Biotechnoi, 74, 5; 
Sun, 2000, Cun: Opm. Mol. Then, 2, 100; Kusser, 2000, J. Biotechnoi, 74, 27; Hermann 
and Patel, 2000, Science, 287, 820; and Jayasena, 1999, Clinical Chemistry, 45, 1628. 

5 By "enzymatic nucleic acid molecule" is meant a nucleic acid molecule that has 

complementarity in a substrate binding region to a specified gene target, and also has an 
enzymatic activity which is active to specifically cleave a target RNA or DNA molecule. 
That is, the enzymatic nucleic acid molecule is able to intermolecularly cleave a RNA or 
DNA molecule and thereby inactivate a target RNA or DNA molecule. These 
10 complementary regions allow sufficient hybridization of the enzymatic nucleic, acid 
molecule to a target RNA molecule and thus permit cleavage. One hundred percent 
complementarity is preferred, but complementarity as low as 50-75% may also be useful 
in this invention (see for example Werner and Uhlenbeck, 1995, Nucleic Acids Research, 
23, 2092-2096; Hammann et al, 1999, Antisense and Nucleic Acid Drug Dev., 9, 25-31). 
15 The nucleic acids can be modified at the base, sugar, and/or phosphate groups. The term 
enzymatic nucleic acid is used interchangeably with phrases such as ribozymes, catalytic 
RNA, enzymatic RNA, catalytic DNA, aptazyme or aptamer-binding ribozyme, 
regulable ribozyme, catalytic oligonucleotides, nucleozyme, DNAzyme, RNA enzyme, 
endoribonuclease, endonuclease, minizyme, leadzyme, oligozyme or DNA enzyme. All 
20 of these terminologies describe nucleic acid molecules with enzymatic activity. The 
specific enzymatic nucleic acid molecules described in the instant application are not 
limiting in the invention and those skilled in the art will recognize that all that is 
important in an enzymatic nucleic acid molecule of this invention is that it have a specific 
substrate binding site which is complementary to one or more of the target nucleic acid 
25 regions, and that it have nucleotide sequences within or surrounding that substrate 
binding site which impart a nucleic acid cleaving activity to the molecule (Cech et al., 
U.S. Patent No. 4,987,071; Cech et al, 1988, JAMA 260:20 3030-4). 

By "nucleic acid molecule" as used herein is meant a molecule comprising 
nucleotides. The nucleic acid can be single, double, or multiple stranded and can 
30 comprise modified or unmodified nucleotides or non-nucleotides or various mixtures and 
combinations thereof. 
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By "Inozyme" or "NCH" motif or configuration is meant, an enzymatic nucleic 
acid molecule comprising a motif as is generally described as NCH Rz in Ludwig et al, 
International PCT Publication No. WO 98/58058 and US Patent Application Serial No. 
08/878,640, winch is herein incorporated by reference in its entirety including the 
5 drawings. Inozymes possess endonuclease activity to cleave RNA substrates having a 
cleavage triplet NCH/, where N is a nucleotide, C is cytidine and H is adenosine, uridine 
or cytidine, and / represents the cleavage site. Inozymes can also possess endonuclease 
activity to cleave RNA substrates having a cleavage triplet NCN/, where N is a 
nucleotide, C is cytidine, and / represents the cleavage site 

10 By "G-cleaver" motif or configuration is meant, an enzymatic nucleic acid 

molecule comprising a motif as is generally described in Eckstein et al, US 6,127,173, 
which is herein incorporated by reference in its entirety including the drawings, and in 
Kore et al, 1998, Nucleic Acids Research 26, 4116-4120. G-cleavers possess 
endonuclease activity to cleave RNA substrates having a cleavage triplet NYN/, where N 

15 is a nucleotide, Y is uridine or cytidine and / represents the cleavage site. G-cleavers can 
be chemically modified. 

By "zinzyme" motif or configuration is meant, an enzymatic nucleic acid molecule 
comprising a motif as is generally described in Beigelman et al, International PCT 
publication No. WO 99/55857 and US Patent Application Serial No. 09/918,728, which is 
20 herein incorporated by reference in its entirety including the drawings. Zinzymes possess 
endonuclease activity to cleave RNA substrates having a cleavage triplet including but 
not limited to, YG/Y, where Y is uridine or cytidine, and G is guanosine and / represents 
the cleavage site. Zinzymes can be chemically modified to increase nuclease stability 
through various substitutions, including substituting 2'-0-methyl guanosine nucleotides 
25 for guanosine nucleotides. In addition, differing nucleotide and/or non-nucleotide linkers 
can be used to substitute the 5'-gaaa-2> loop of the motif. Zinzymes represent a non- 
limiting example of an enzymatic nucleic acid molecule that does not require a 
ribonucleotide (2'-OH) group within its own nucleic acid sequence for activity. 

By "amberzyme" motif or configuration is meant, an enzymatic nucleic acid 
30 molecule comprising a motif as is generally described in Beigelman et al, International 
PCT publication No. WO 99/55857 and US Patent Application Serial No. 09/476,387, 
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which is herein incorporated by reference in its entirety including the drawings. 
Amberzymes possess endonuclease activity to cleave RNA substrates having a cleavage 
triplet NG/N, where N is a nucleotide, G is guanosine, and / represents the cleavage site. 
Amberzymes can be chemically modified to increase nuclease stability. In addition, 
5 differing nucleoside and/or non-nucleoside linkers can be used to substitute the 5'-gaaa-3' 
loops of the motif. Amberzymes represent a non-limiting example of an enzymatic 
nucleic acid molecule that does not require a ribonucleotide (2'-OH) group within its own 
nucleic acid sequence for activity. 

By 'DNAzyme' is meant, an enzymatic nucleic acid molecule that does not require 
10 the presence of a 2'-OH group within its own nucleic acid sequence for activity. In 
particular embodiments, the enzymatic nucleic acid molecule can have an attached linker 
or linkers or other attached or associated groups, moieties, or chains containing one or 
more nucleotides with 2*-OH groups. DNAzymes can be synthesized chemically or 
expressed endogenously in vivo, by means of a single stranded DNA vector or equivalent 
15 thereof. Non-limiting examples of DNAzymes are generally reviewed in Usman et al, 
US patent No. 6,159,714, which is herein incorporated by reference in its entirety 
including the drawings; Chartrand et al, 1995, NAR 23, 4092; Breaker et al, 1995, Chem. 
Bio. 2, 655; Santoro et al, 1997, PNAS 94, 4262; Breaker, 1999, Nature Biotechnology, 
17, 422-423; and Santoro et. al, 2000, J. Am. Chem. Soc, 122, 2433-39. The "10-23" 
20 DNAzyme motif is one particular type of DNAzyme that was evolved using in vitro 
selection as generally described in Joyce et al, US 5,807,718 and Santoro et al, supra. 
Additional DNAzyme motifs can be selected for using techniques similar to those 
described in these references, and hence, are within the scope of the present invention. 

By "nucleic acid sensor molecule" or "allozyme" as used herein is meant a nucleic 
25 acid molecule comprising an enzymatic domain and a sensor domain, where the ability of 
the enzymatic nucleic acid domain's ability to catalyze a chemical reaction is dependent 
on the interaction with a target signaling molecule, such as a nucleic acid, polynucleotide, 
oligonucleotide, peptide, polypeptide, or protein, for example HTV-1 envelope 
glygoprotein, gp41, or gpl20. The introduction of chemical modifications, additional 
30 functional groups, and/or linkers, to the nucleic acid sensor molecule can provide 
enhanced catalytic activity of the nucleic acid sensor molecule, increased binding affinity 
of the sensor domain to a target nucleic acid, and/or improved nuclease/chemical stability 
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of the nucleic acid sensor molecule, and are hence within the scope of the present 
invention (see for example Usman et al, US Patent Application No. 09/877,526, George 
et al, US Patent Nos. 5,834,186 and 5,741,679, Shih et al, US Patent No. 5,589,332, 
Nathan et al, US Patent No 5,871,914, Nathan and Ellington, International PCT 
5 publication No. WO 00/24931, Breaker et al, International PCT Publication Nos. WO 
00/26226 and 98/27104, and Sullenger et al, US Patent Application Serial No. 
09/205,520). 

By "sensor component" or "sensor domain" of the nucleic acid sensor molecule as 
used herein is meant, a nucleic acid sequence (e.g., RNA or DNA or analogs thereof) 
10 which interacts with a target signaling molecule, for example a nucleic acid sequence in 
one or more regions of a target nucleic acid molecule or more than one target nucleic acid 
molecule, and which interaction causes the enzymatic nucleic acid component of the 
nucleic acid sensor molecule to either catalyze a reaction or stop catalyzing a reaction. In 
the presence of target signaling molecule of the invention, such as HIV-1 envelope 
15 glycoprotein or portions thereof such as gp41 and/or gpl20, the ability of the sensor 
component, for example, to modulate the catalytic activity of the nucleic acid sensor 
molecule, is modulated or diminished. The sensor component can comprise recognition 
properties relating to chemical or physical signals capable of modulating the nucleic acid 
sensor molecule via chemical or physical changes to the structure of the nucleic acid 
20 sensor molecule. The sensor component can be derived from a naturally occurring 
nucleic acid binding sequence, for example, RNAs that bind to other nucleic acid 
sequences in vivo. Alternately, the sensor component can be derived from a nucleic acid 
molecule (aptamer), which is evolved to bind to a nucleic acid sequence within a target 
nucleic acid molecule. The sensor component can be covalently linked to the nucleic 
25 acid sensor molecule, or can be non-covalently associated. A person skilled in the art 
will recognize that all that is required is that the sensor component is able to selectively 
modulate the activity of the nucleic acid sensor molecule to catalyze a reaction. 

By "target molecule" or "target signaling molecule" is meant a molecule capable 
of interacting with a nucleic acid sensor molecule, specifically a sensor domain of a 
30 nucleic acid sensor molecule, in a manner that causes the nucleic acid sensor molecule to 
be active or inactive. The interaction of the signaling agent with a nucleic acid sensor 
molecule can result in modification of the enzymatic nucleic acid component of the 
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nucleic acid sensor molecule via chemical, physical, topological, or conformational 
changes to the structure of the molecule, such that the activity of the enzymatic nucleic 
acid component of the nucleic acid sensor molecule is modulated, for example is 
activated or deactivated. Signaling agents can comprise target signaling molecules such 

5 as macromolecules, ligands, small molecules, metals and ions, nucleic acid molecules 
including but not limited to RNA and DNA or analogs thereof, proteins, peptides, 
antibodies, polysaccharides, lipids, sugars, microbial or cellular metabolites, 
pharmaceuticals, and organic and inorganic molecules in a purified or unpurified form, 
for example HIV envelope glycoprotein or portions thereof such as gp41, gp!20, and/or 

10 peptide sequences such as SEQ ID Nos 1233 and 1234 or analogs thereof. 

By "sufficient length" is meant a nucleic acid molecule long enough to provide the 
intended function under the expected condition. For example, a nucleic acid molecule of 
the invention needs to be of "sufficient length" to provide stable binding to a target site 
under the expected binding conditions and environment. In another non-limiting 

15 example, for the binding arms of an enzymatic nucleic acid, "sufficient length" means 
that the binding arm sequence is long enough to provide stable binding to a target site 
under the expected reaction conditions and environment. The binding arms are not so 
long as to prevent useful turnover of the nucleic acid molecule. By "stably interact" is 
meant interaction of the oligonucleotides with target, such as a target protein or target 

20 nucleic acid (e.g., by forming hydrogen bonds with complementary amino acids or 
nucleotides in the target under physiological conditions) that is sufficient for the intended 
purpose (e.g., specific binding to a protein target to disrupt the function of that protein or 
cleavage of target RNA/DNA by an enzyme). 

By "homology" is meant the nucleotide sequence of two or more nucleic acid 
25 molecules, or the amino acid sequence of two or more proteins, is partially or completely 
identical. 

By "antisense nucleic acid", it is meant a non-enzymatic nucleic acid molecule that 
binds to target RNA by means of RNA-RNA or RNA-DNA or RNA-PNA (protein 
nucleic acid; Egholm et al, 1993 Nature 365, 566) interactions and alters the activity of 
30 the target RNA (for a review, see Stein and Cheng, 1993 Science 261, 1004 and Woolf et 
al. US patent No. 5,849,902). Typically, antisense molecules are complementary to a 
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target sequence along a single contiguous sequence of the antisense molecule. However, 
in certain embodiments, an antisense molecule can bind to substrate such that the 
substrate molecule forms a loop, and/or an antisense molecule can bind such that the 
antisense molecule forms a loop. Thus, the antisense molecule can be complementary to 

5 two or more non-contiguous substrate sequences or two or more non-contiguous sequence 
portions of an antisense molecule can be complementary to a target sequence, or both. 
For a review of current antisense strategies, see Schmajuk et al., 1999, J. Biol. Chan., 
274, 21783-21789, Delihas et al, 1997, Nature, 15, 751-753, Stein et al, 1997, Antisense 
N. A. Drug Dev., 7, 151, Crooke, 2000, Methods Enzymol, 313, 3-45; Crooke, 1998, 

10 Biotech. Genet. Eng. Rev., 15, 121-157, Crooke, 1997, Ad. Pharmacol, 40, 1-49. 
Antisense molecules of the instant invention can include 2-5A antisense chimera 
molecules. In addition, antisense DNA can be used to target RNA by means of DNA- 
RNA interactions, thereby activating RNase H, which digests the target RNA in the 
duplex. The antisense oligonucleotides can comprise one or more RNAse H activating 

15 region that is capable of activating RNAse H cleavage of a target RNA. Antisense DNA 
can be synthesized chemically or expressed via the use of a single stranded DNA 
expression vector or equivalent thereof. 

By "RNase H activating region" is meant a region (generally greater than or equal 
to 4-25 nucleotides in length, preferably from 5-11 nucleotides in length) of a nucleic acid 
20 molecule capable of binding to a target RNA to form a non-covalent complex that is 
recognized by cellular RNase H enzyme (see for example Arrow et al, US 5,849,902; 
Arrow et al, US 5,989,912). The RNase H enzyme binds to the nucleic acid molecule- 
target RNA complex and cleaves the target RNA sequence. The RNase H activating 
region comprises, for example, phosphodiester, phosphorothioate (for example, at least 
25 four of the nucleotides are phosphorothiote substitutions; more specifically, 4-1 1 of the 
nucleotides are phosphorothiote substitutions), phosphorodithioate, 5'-thiophosphate, or 
methylphosphonate backbone chemistry or a combination thereof. In addition to one or 
more backbone chemistries described above, the RNase H activating region can also 
comprise a variety of sugar chemistries. For example, the RNase H activating region can 
30 comprise deoxyribose, arabino, fluoroarabino or a combination thereof, nucleotide sugar 
chemistry. Those skilled in the art will recognize that the foregoing are non-limiting 
examples and that any combination of phosphate, sugar and base chemistry of a nucleic 
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acid that supports the activity of RNase H enzyme is within the scope of the definition of 
the RNase H activating region and the instant invention. 

By "2-5A antisense chimera" it is meant, an antisense oligonucleotide containing a 
S'-phosphorylated 2'-5'-linked adenylate residue. These chimeras bind to target RNA in a 
5 sequence-specific manner and activate a cellular 2-5A-dependent ribonuclease, which, in 
turn, cleaves the target RNA (Torrence et al, 1993 Proc. Natl. Acad. Sci. USA 90, 1300). 

By "triplex nucleic acid" or "triplex oligonucleotide" it is meant a polynucleotide or 
oligonucleotide that can bind to a double-stranded DNA in a sequence-specific manner to 
form a triple-strand helix. Formation of such triple helix structure has been shown to 
10 modulate transcription of the targeted gene (Duval-Valentin et al., 1992, Proc. Natl. 
Acad. Sci.USA, 89, 504). Triplex nucleic acid molecules of the invention also include 
steric blocker nucleic acid molecules that bind to the Enhancer I region of HBV DNA 
(plus strand and/or minus strand) and prevent translation of HBV genomic DNA. 

The term "short interfering nucleic acid", "siNA", "short interfering RNA", 
15 "siRNA", "short interfering nucleic acid molecule", "short interfering oligonucleotide 
molecule", or "chemically-modified short interfering nucleic acid molecule" as used 
herein refers to any nucleic acid molecule capable of inhibiting or down regulating gene 
expression or viral replication, for example by mediating RNA interference "RNAi" or 
gene silencing in a sequence-specific maimer; see for example Bass, 2001, Nature, 411, 
20 428-429; Elbashir et al, 2001, Nature, 411, 494-498; and Kreutzer et al, International 
PCT Publication No. WO 00/44895; Zemicka-Goetz et al, International PCT Publication 
No. WO 01/36646; Fire, International PCT Publication No. WO 99/32619; Plaetinck et 
al, International PCT Publication No. WO 00/01846; Mello and Fire, International PCT 
Publication No. WO 01/29058; Deschamps-Depaillette, International PCT Publication 
25 No. WO 99/07409; and Li et al, International PCT Publication No. WO 00/44914; 
Allshire, 2002, Science, 297, 1818-1819; Volpe et al, 2002, Science, 297, 1833-1837; 
Jenuwein, 2002, Science, 297, 2215-2218; and Hall etal, 2002, Science, 297, 2232-2237; 
Hutvagner and Zamore, 2002, Science, 297, 2056-60; McManus et al, 2002, RNA, 8, 
842-850; Reinhart et al, 2002, Gene & Dev., 16, 1616-1626; and Reinhart & Bartel, 
30 2002, Science, 297, 1831). For example the siNA can be a double-stranded 
polynucleotide molecule comprising self-complementary sense and antisense regions, 
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wherein the antisense region comprises nucleotide sequence that is complementary to 
nucleotide sequence in a target nucleic acid molecule or a portion thereof and the sense 
region having nucleotide sequence corresponding to the target nucleic acid sequence or a 
portion thereof. The siNA can be assembled from two separate oligonucleotides, where 
5 one strand is the sense strand and the other is the antisense strand, wherein the antisense 
and sense strands are self-complementary (i.e. each strand comprises nucleotide sequence 
that is complementary to nucleotide sequence in the other strand; such as where the 
antisense strand and sense strand form a duplex or double stranded structure, for example 
wherein the double stranded region is about 19 base pairs); the antisense strand comprises 
10 nucleotide sequence that is complementary to nucleotide sequence in a target nucleic acid 
molecule or a portion thereof and the sense strand comprises nucleotide sequence 
corresponding to the target nucleic acid sequence or a portion thereof. Alternatively, the 
siNA is assembled from a single oligonucleotide, where the self-complementary sense 
and antisense regions of the siNA are linked by means of a nucleic acid based or non- 
15 nucleic acid-based linker(s). The siNA can be a polynucleotide with a hairpin secondary 
structure, having self-complementary sense and antisense regions, wherein the antisense 
region comprises nucleotide sequence that is complementary to nucleotide sequence in a 
separate target nucleic acid molecule or a portion thereof and the sense region having 
nucleotide sequence corresponding to the target nucleic acid sequence or a portion 
20 thereof. The siNA can be a circular single-stranded polynucleotide having two or more 
loop structures and a stem comprising self-complementary sense and antisense regions, 
wherein the antisense region comprises nucleotide sequence that is complementary to 
nucleotide sequence in a target nucleic acid molecule or a portion thereof and the sense 
region having nucleotide sequence corresponding to the target nucleic acid sequence or a 
25 portion thereof, and wherein the circular polynucleotide can be processed either in vivo or 
in vitro to generate an active siNA molecule capable of mediating RNAi. The siNA can 
also comprise a single stranded polynucleotide having nucleotide sequence 
complementary to nucleotide sequence in a target nucleic acid molecule or a portion 
thereof (for example, where such siNA molecule does not require the presence within the 
30 siNA molecule of nucleotide sequence corresponding to the target nucleic acid sequence 
or a portion thereof), wherein the single stranded polynucleotide can further comprise a 
terminal phosphate group, such as a 5'-phosphate (see for example Martinez et al, 2002, 
Cell, 110, 563-574 and Schwarz et al, 2002, Molecular Cell, 10, 537-568), or 5\3'- 
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diphosphate, hi certain embodiment, the siNA molecule of the invention comprises 
separate sense and antisense sequences or regions, wherein the sense and antisense 
regions are covalently linked by nucleotide or non-nucleotide linkers molecules as is 
known in the art, or are alternately non-covalently linked by ionic interactions, hydrogen 
bonding, van der waals interactions, hydrophobic intercations, and/or stacking 
interactions. In certain embodiments, the siNA molecules of the invention comprise 
nucleotide sequence that is complementary to nucleotide sequence of a target gene. In 
another embodiment, the siNA molecule of the invention interacts with nucleotide 
sequence of a target gene in a manner that causes inhibition of expression of the target 
gene. As used herein, siNA molecules need not be limited to those molecules containing 
only RNA, but further encompasses chemically-modified nucleotides and non- 
nucleotides. hi certain embodiments, the short interfering nucleic acid molecules of the 
invention lack 2'-hydroxy (2'-OH) containing nucleotides. Applicant describes in certain 
embodiments short interfering nucleic acids that do not require the presence of 
nucleotides having a 2 , -hydroxy group for mediating RNAi and as such, short interfering 
nucleic acid molecules of the invention optionally do not include any ribonucleotides 
(e.g., nucleotides having a 2'-OH group). Such siNA molecules that do not require the 
presence of ribonucleotides within the siNA molecule to support RNAi can however have 
an attached linker or linkers or other attached or associated groups, moieties, or chains 
containing one or more nucleotides with 2'-OH groups. Optionally, siNA molecules can 
comprise ribonucleotides at about 5, 10, 20, 30, 40, or 50% of the nucleotide positions. 
The modified short interfering nucleic acid molecules of the invention can also be 
referred to as short interfering modified oligonucleotides "siMON." As used herein, the 
term siNA is meant to be equivalent to other terms used to describe nucleic acid 
molecules that are capable of mediating sequence specific RNAi, for example short 
interfering RNA (siRNA), double-stranded RNA (dsRNA), micro-RNA (miRNA), short 
hairpin RNA (shRNA), short interfering oligonucleotide, short interfering nucleic acid, 
short interfering modified oligonucleotide, chemically-modified siRNA, post- 
transcriptional gene silencing RNA (ptgsRNA), and others. In addition, as used herein, 
the term RNAi is meant to be equivalent to other terms used to describe sequence specific 
RNA interference, such as post transcriptional gene silencing, or epigenetics. For 
example, siNA molecules of the invention can be used to epigenetically silence genes at 
both the post-transcriptional level or the pre-transcriptional level. In a non-limiting 
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example, epigenetic regulation of gene expression by siNA molecules of the invention 
can result from siNA mediated modification of chromatin structure to alter gene 
expression (see, for example, Allshire, 2002, Science, 297, 1818-1819; Volpe et al, 2002, 
Science, 297, 1833-1837; Jenuwein, 2002, Science, 297, 2215-2218; and Hall et al, 2002, 
5 Science, 297, 2232-2237). 

By "gene" it is meant, a nucleic acid that encodes an RNA, for example, nucleic 
acid sequences including, but not limited to, structural genes encoding a polypeptide. 

By "complementarity" is meant that a nucleic acid can form hydrogen bond(s) with 
another nucleic acid sequence by either traditional Watson-Crick or other non-traditional 

10 types. In reference to the nucleic molecules of the present invention, the binding free 
energy for a nucleic acid molecule with its target or complementary sequence is sufficient 
to allow the relevant function of the nucleic acid to proceed, e.g., ribozyme cleavage, 
antisense or triple helix modulation. Determination of binding free energies for nucleic 
acid molecules is well known in the art (see, e.g., Turner et al., 1987, CSH Symp. Quant. 

15 Biol. UL pp.123-133; Frier et al., 1986, Proc. Nat. Acad. Sci. USA 83:9373-9377; Turner 
et al., 1987, J. Am. Client. Soc. 109:3783-3785). A percent complementarity indicates the 
percentage of contiguous residues in a nucleic acid molecule that can form hydrogen 
bonds (e.g., Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 
8, 9, 10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). 

20 "Perfectly complementary" means that all the contiguous residues of a nucleic acid 
sequence will hydrogen bond with the same number of contiguous residues in a second 
nucleic acid sequence. 

The nucleic acid aptamers that bind to a HIV envelope glycoprotein and therefore 
inactivate the cellular fusion and entry represent a novel therapeutic approach to treat 
25 HIV infection, AIDS and related conditions. 

In one embodiment of the present invention, an aptamer nucleic acid molecule of 
the invention is about 4 to about 50 nucleotides in length, in specific embodiments about 
4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 
30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 
30 nucleotides in length. In another embodiment, an enzymatic nucleic acid molecule of the 
invention, e.g., a ribozyme or DNAzyme, is about 13 to about 100 nucleotides in length, 
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e.g., in specific embodiments about 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 
36, 37, or 38 nucleotides in length. In another embodiment, an antisense nucleic acid 
molecule, 2,5-A chimera, or triplex oligonucleotide of the invention is about 13 to about 
100 nucleotides in length, e.g., in specific embodiments about 22, 23, 24, 25, 26, 27, 28, 
29, 30, 31, 32, 33, 34, 35, 36, 37, or 38 nucleotides in length. In another embodiment, a 
siRNA molecule of the invention is about 18 to about 24 nucleotides in length (such as 
where each strand of siRNA duplex is about 18 to about 24 nucleotides in length), e.g., in 
specific embodiments, each strand of the siRNA duplex is about 18, 19, 20, 21, 22, 23, or 
24 nucleotides in length. In yet another embodiment, a siRNA molecule of the invention 
has 2 3'-nucleotide overhangs on each strand of the duplex, for example two thymidine 
(TT) nucleotide overhangs. In particular embodiments, instead of 100 nucleotides being 
the upper limit on the length ranges specified above, the upper limit of the length range 
can be, for example, 30, 40, 50, 60, 70, or 80 nucleotides. Thus, for any of the length 
ranges, the length range for particular embodiments has lower limit as specified, with an 
upper limit as specified which is greater than the lower limit. For example, in a particular 
embodiment, the length range can be 20-50 nucleotides in length. All such ranges are 
expressly included. Also in particular embodiments, a nucleic acid molecule can have a 
length which is any of the lengths specified above, for example, 21 nucleotides in length. 

Aptamer molecules of the invention are about 4 to about 50 nucleotides in length. 
Exemplary siRNA molecules of the invention are about 18 to about 24 nucleotides in 
length for each strand of the siRNA duplex. In an additional example, enzymatic nucleic 
acid molecules of the invention are preferably about 15 to about 50 nucleotides in length, 
more preferably about 25 to about 40 nucleotides in length, e.g., 34, 36, or 38 nucleotides 
in length (for example see Jarvis et at, 1996, J. Biol. Chem., 271, 29107-29112). 
Exemplary DNAzymes of the invention are preferably about 15 to about 40 nucleotides in 
length. In one embodiment, exemplary DNAzymes are about 25 to about 35 nucleotides 
in length, e.g., 29, 30, 31, or 32 nucleotides in length (see for example Santoro et al, 
1998, Biochemistty, 37, 13330-13342; Chartrand et al, 1995, Nucleic Acids Research, 
23, 4092-4096). Exemplary antisense molecules of the invention are about 15 to about 75 
nucleotides in length. In one embodiment, exemplary antisense molecules are about 20 to 
about 35 nucleotides in length, e.g., 25, 26, 27, or 28 nucleotides in length (see for 
example Woolf et al, 1992, PNAS., 89, 7305-7309; Milner et al, 1997, Nature 
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Biotechnology, 15, 537-541). Exemplary triplex forming oligonucleotide molecules of 
the invention are about 10 to about 40 nucleotides in length. In one embodiment, 
exemplary triplex forming oligonucleotide molecules are about 12 to about 25 nucleotides 
in length, e.g., 18, 19, 20, or 21 nucleotides in length (see for example Maher et al, 1990, 
5 Biochemistry, 29, 8820-8826; Strobel and Dervan, 1990, Science, 249, 73-75). Those 
skilled in the art will recognize that all that is required is that the nucleic acid molecule is 
of length and conformation sufficient and suitable for the nucleic acid molecule to 
catalyze a reaction contemplated herein. The length of the nucleic acid molecules of the 
instant invention are not limiting within the general limits stated. 

10 In one embodiment, the invention provides a method for producing a class of 

nucleic acid aptamers which exhibit a high degree of specificity for a HIV envelope 
glycoprotein such as a site within the gp41 region of HIV envelope glycoprotein. In 
another embodiment, the invention provides a method for producing a class of nucleic 
acid based gene modulating agents which exhibit a high degree of specificity for HIV 
15 nucleic acid sequences encoding the HIV envelope glycoprotein. For example, the 
nucleic acid gene modulating molecule is preferably targeted to a highly conserved region 
of the HIV env gene such that specific treatment of a disease or condition can be provided 
with either one or several nucleic acid molecules of the invention. Alternately, the 
nucleic acid aptamer molecule is preferably targeted to a highly conserved region of the 
20 HIV envelope glycoprotein such that specific treatment of a disease or condition can be 
provided with either one or several nucleic acid molecules of the invention. Such nucleic 
acid molecules can be delivered exogenously to specific tissue or cellular targets as 
required. Alternatively, the nucleic acid molecules can be expressed from DNA and/or 
RNA vectors that are delivered to specific cells. 

25 As used herein "cell" is used in its usual biological sense, and does not refer to an 

entire multicellular organism, e.g., specifically does not refer to a human. The cell can be 
present in an organism, e.g., birds, plants and mammals such as humans, cows, sheep, 
apes, monkeys, swine, dogs, and cats. The cell can be prokaryotic (e.g., bacterial cell) or 
eukaryotic (e.g., mammalian or plant cell). 

30 By "HIV envelope glycoprotein" is meant, a protein or a mutant protein derivative 

thereof, comprising sequence expressed and/or encoded by the HIV env gene. Non- 
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limiting examples of the HIV envelope glycoprotein are represented by Genbank 
Accession Nos. AAM09869-AAM09880. HIV envelope glycoproteins contemplated by 
the invention include gpl20 and gp41. 

By "highly conserved nucleic acid binding region" is meant an amino acid sequence 
5 of one or more regions in a target protein that does not vary significantly from one 
generation to the other or from one biological system to the other. 

The enzymatic nucleic acid-based modulators of HIV fusogenic activity are useful 
for the prevention of the diseases and conditions including HTV infection, AIDS, and any 
other diseases or conditions that are related to the levels of HIV in a cell or tissue. 

10 By "related to the levels of HIV" is meant that the reduction of HTV fusogenic 

activity and cell entry and/or gene expression (specifically HTV gene) and thus reduction 
in the level of the HW expression in an organism will relieve, to some extent, the 
symptoms of the disease or condition. 

The nucleic acid-based modulators of the invention are added directly, or can be 
15 complexed with cationic lipids, packaged within liposomes, or otherwise delivered to 
target cells or tissues. The nucleic acid or nucleic acid complexes can be locally 
administered to relevant tissues ex vivo, or in vivo through injection, infusion pump or 
stent, with or without their incorporation in biopolymers. In particular embodiments, the 
nucleic acid molecules of the invention comprise sequences shown in Tables III - XI. 
20 Examples of such nucleic acid molecules consist essentially of sequences defined in the 
tables. 

In another aspect, the invention provides mammalian cells containing one or more 
nucleic acid molecules and/or expression vectors of this invention. The one or more 
nucleic acid molecules can independently be targeted to the same or different sites. 

25 hi another aspect of the invention, nucleic acid molecules of the invention are 

expressed from transcription units inserted into DNA or RNA vectors. The recombinant 
vectors are preferably DNA plasmids or viral vectors. Nucleic acid expressing viral 
vectors can be constructed based on, but not limited to, adeno-associated virus, retrovirus, 
adenovirus, or alphavirus. Preferably, the recombinant vectors capable of expressing 
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nucleic acid molecules of the invention are delivered as described above, and persist in 
target cells. Alternatively, viral vectors may be used that provide for transient expression 
of the nucleic acid molecules of the invention. Such vectors might be repeatedly 
administered as necessary. Once expressed, the nucleic acid molecules of the invention 
bind to the target protein, RNA and/or DNA and modulate its function or expression. 
Delivery of nucleic acid expressing vectors can be systemic, such as by intravenous or 
intramuscular administration, by administration to target cells ex-planted from the patient 
followed by reintroduction into the patient, or by any other means that would allow for 
introduction into the desired target cell. DNA based nucleic acid molecules of the 
invention can be expressed via the use of a single stranded DNA intracellular expression 
vector. 

By RNA is meant a molecule comprising at least one ribonucleotide residue. By 
"ribonucleotide" is meant a nucleotide with a hydroxyl group at the 2' position of a p-D- 
ribo-furanose moiety. 

By "vectors" is meant any nucleic acid- and/or viral-based technique used to 
express and/or deliver a desired nucleic acid. 

By "patient" or "subject" is meant an organism, which is a donor or recipient of 
explanted cells or the cells themselves. "Patient" also refers to an organism to which the 
nucleic acid molecules of the invention can be administered. In one embodiment, a 
patient is a mammal or mammalian cells. In another embodiment, a patient is a human or 
human cells. 

The nucleic acid molecules of the instant invention, individually, or in combination 
or in conjunction with other drugs, can be used to treat diseases or conditions discussed 
herein. For example, to treat a disease or condition associated with the levels of HTV, the 
nucleic acid molecules can be administered to a patient or can be administered to other 
appropriate cells evident to those skilled in the art, individually or in combination with 
one or more drugs under conditions suitable for the treatment. 

In a further embodiment, the described molecules, such as aptamers, siRNA, 
antisense, or enzymatic nucleic acids, can be used in combination with other known 
treatments to treat conditions or diseases discussed above. For example, the described 
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molecules could be used in combination with one or more known therapeutic agents to 
treat HIV infection and/or AIDS. Such therapeutic agents may include, but are not 
limited to, reverse transcriptase inhibitors such as zidovudine (AZT), zalcitabine (DDC), 
zidovudine (ZDV), lamivudine (3TC), didanosinedelavirdine (DDI), stavudine (D4T), 
5 abacavir, efavirenz, nevirapine, or tenofovir disoproxil fumarate, ribavirin and/or protease 
inhibitors such as indinavir, amprenavir, saquinavir, lopinavir, ritonavir, or nelfmavir, or 
any combination thereof under conditions suitable for said treatment. 

Other features and advantages of the invention will be apparent from the following 
description of the preferred embodiments thereof, and from the claims. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic design which outlines the steps involved in HIV cell fusion 
and entry. 

Figure 2 is a schematic design that shows a non-limiting example of inhibition of 
HIV cell fusion and entry. 

l5 DETAILED DESCRIPTION OF THE INVENTION 

Mechanism of action of Nucleic Acid Mole cules of the Invention 

Aptamer : Nucleic acid aptamers can be selected to specifically bind to a particular 
ligand of interest (see for example Gold et at, US 5,567,588 and US 5,475,096, Gold et 
al, 1995, Annu. Rev. Biochem., 64, 763; Brody and Gold, 2000, J. Biotechnol, 74, 5; 
20 Sun, 2000, Curr. Opin. Mol. Ther., 2, 100; Kusser, 2000, J. Biotechnol, 74, 27; Hermann 
and Patel, 2000, Science, 287, 820; and Jayasena, 1999, Clinical Chemistry, 45, 1628). 
For example, the use of in vitro selection can be applied to evolve nucleic acid aptamers 
with binding specificity for fflV envelope glycoprotein gp41, gpl20 or to any other 
portion of HIV that disrupts fusogenic activity of the virus. Nucleic acid aptamers can 
25 include chemical modifications and linkers as described herein. Nucleic apatmers of the 
invention can be double stranded or single stranded and can comprise one distinct nucleic 
acid sequence or more than one nucleic acid sequences complexed with one another. 
Aptamer molecules of the invention that bind to HIV envelope glycoprotein, for example 
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gp41, can modulate the fusogenic activity of HIV and therefore modulate cell entry and 
infectivity of the virus. 

Antisense : Antisense molecules can be modified or unmodified RNA, DNA, or 
mixed polymer oligonucleotides and primarily function by specifically binding to 
matching sequences resulting in modulation of peptide synthesis (Wu-Pong, Nov 1994, 
BioPharm, 20-33). The antisense oligonucleotide binds to target RNA by Watson Crick 
base-pairing and blocks gene expression by preventing ribosomal translation of the bound 
sequences either by steric blocking or by activating RNase H enzyme. Antisense 
molecules may also alter protein synthesis by interfering with RNA processing or 
transport from the nucleus into the cytoplasm (Mukhopadhyay & Roth, 1996, Crit. Rev. 
in Oncogenesis 7, 151-190). 

In addition, binding of single stranded DNA to RNA may result in nuclease 
degradation of the heteroduplex (Wu-Pong, supra; Crooke, supra). To date, the only 
backbone modified DNA chemistry which will act as substrates for RNase H are 
15 phosphorothioates, phosphorodithioates, and borontrifluoridates. Recently, it has been 
reported that 2'-arabino and 2'-fluoro arabino- containing oligos can also activate RNase 
H activity. 

A number of antisense molecules have been described that utilize novel 
configurations of chemically modified nucleotides, secondary structure, and/or RNase H 
20 substrate domains (Woolf et al, US 5,989,912; Thompson et al, USSN 60/082,404 
which was filed on April 20, 1998; Hartmann et al, USSN 60/101,174 which was filed 
on September 21, 1998) all of these are incorporated by reference herein in their entirety. 

Antisense DNA can be used to target RNA by means of DNA-RNA interactions, 
thereby activating RNase H, which digests the target RNA in the duplex. Antisense DNA 
25 can be chemically synthesized or can be expressed via the use of a single stranded DNA 
intracellular expression vector or the equivalent thereof. 

Tri plex Forminp Oli ponucleotides (TFO) : Single stranded oligonucleotide can be 
designed to bind to genomic DNA in a sequence specific manner. TFOs can be 
comprised of pyrimidine-rich oligonucleotides which bind DNA helices through 
30 Hoogsteen Base-pairing (Wu-Pong, supra). In addition, TFOs can be chemically 
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modified to increase binding affinity to target DNA sequences. The resulting triple helix 
composed of the DNA sense, DNA antisense, and TFO disrupts RNA synthesis by RNA 
polymerase. The TFO mechanism can result in gene expression or cell death since 
binding may be irreversible (Mukhopadhyay & Roth, supra) 

5 V-V Oligoadenvlates : The 2-5A system is an interferon-mediated mechanism for 

RNA degradation found in higher vertebrates (Mitra et al., 1996, Proc Nat Acad Sci USA 
93, 6780-6785). Two types of enzymes, 2-5A synthetase and RNase L, are required for 
RNA cleavage. The 2-5 A synthetases require double stranded RNA to form 2'-5' 
oligoadenylates (2-5A). 2-5A then acts as an allosteric effector for utilizing RNase L, 

10 which has the ability to cleave single stranded RNA. The ability to form 2-5A structures 
with double stranded RNA makes this system particularly useful for modulation of viral 
replication. 

(2'-5') oligoadenylate structures can be covalently linked to antisense molecules to 
form chimeric oligonucleotides capable of RNA cleavage (Torrence, supra). These 
15 molecules putatively bind and activate a 2-5A-dependent RNase, the 
oligonucleotide/enzyme complex then binds to a target RNA molecule which can then be 
cleaved by the RNase enzyme. The covalent attachment of 2'-5' oligoadenylate 
structures is not limited to antisense applications, and can be further elaborated to include 
attachment to nucleic acid molecules of the instant invention. 

20 Enzymatic Nucleic Acid : Several varieties of naturally occurring enzymatic RNAs 

are presently known (Doherty and Doudna, 2001, Annu. Rev. Biophys. Biomol. Struct, 
30, 457-475; Symons, 1994, Curr. Opin. Struct. Biol, 4, 322-30). In addition, several in 
vitro selection (evolution) strategies (Orgel, 1979, Proc. R. Soc. London, B 205, 435) 
have been used to evolve new nucleic acid catalysts capable of catalyzing cleavage and 

25 ligation of phosphodiester linkages (Joyce, 1989, Gene, 82, 83-87; Beaudry et al, 1992, 
Science 257, 635-641; Joyce, 1992, Scientific American 267, 90-97; Breaker et al, 1994, 
TIBTECH 12, 268; Bartel et al, 1993, Science 261:1411-1418; Szostak, 1993, TIBS 17, 
89-93; Kumar et al, 1995, FASEB 1, 9, 1183; Breaker, 1996, Curr. Op. Biotech., 7, 442; 
Santoro et al, 1997, Proc. Natl. Acad. Sci., 94, 4262; Tang et al, 1997, RNA 3, 914; 

30 Nakamaye & Eckstein, 1994, supra; Long & Uhlenbeck, 1994, supra; Ishizaka et al., 
1995, supra; Vaish et al, 1997, Biochemistiy 36, 6495). Each can catalyze a series of 
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reactions including the hydrolysis of phosphodiester bonds in trans (and thus can cleave 
other RNA molecules) under physiological conditions. 

Nucleic acid molecules of this invention can block HIV protein expression, 
specifically, HTV env protein expression, and can be used to treat disease or diagnose 
5 disease associated with the levels of HIV. 

The enzymatic nature of an enzymatic nucleic acid has significant advantages, such 
as the concentration of nucleic acid necessary to affect a therapeutic treatment is low. 
This advantage reflects the ability of the enzymatic nucleic acid molecule to act 
enzymatically. Thus, a single enzymatic nucleic acid molecule is able to cleave many 
10 molecules of target RNA. In addition, the enzymatic nucleic acid molecule is a highly 
specific modulator, with the specificity of modulation depending not only on the base- 
pairing mechanism of binding to the target RNA, but also on the mechanism of target 
RNA cleavage. Single mismatches, or base-substitutions, near the site of cleavage can be 
chosen to completely eliminate catalytic activity of an enzymatic nucleic acid molecule. 

15 Nucleic acid molecules having an endonuclease enzymatic activity are able to 

repeatedly cleave other separate RNA molecules in a nucleotide base sequence-specific 
manner. With proper design and construction, such enzymatic nucleic acid molecules can 
be targeted to any RNA transcript, and efficient cleavage achieved in vitro (Zaug et al, 
324, Nature 429 1986; Uhlenbeck, 1987 Nature 328, 596; Kim et al., 84 Proc. Natl. 

20 Acad. Sci. USA 8788, 1987; Dreyfus, 1988, Einstein Quart. J. Bio. Med., 6, 92; Haseloff 
and Gerlach, 334 Nature 585, 1988; Cech, 260 JAMA 3030, 1988; and Jefferies et al., 17 
Nucleic Acids Research 1371, 1989; Chartrand et al., 1995, Nucleic Acids Research 23, 
4092; Santoro et al, 1997, PNAS 94, 4262). 

Because of their sequence specificity, trans-cleaving enzymatic nucleic acid 
25 molecules show promise as therapeutic agents for human disease (Usman & McSwiggen, 
1995 Ann. Rep. Med. Chem. 30, 285-294; Christoffersen and Marr, 1995 J. Med. Chem. 
38, 2023-2037). Enzymatic nucleic acid molecule can be designed to cleave specific 
RNA targets within the background of cellular RNA. Such a cleavage event renders the 
RNA non-functional and abrogates protein expression from that RNA. In this manner, 
30 synthesis of a protein associated with a disease state can be selectively modulated 
(Warashina et al, 1999, ChemisUy and Biology, 6,237-250. 
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The present invention also features nucleic acid sensor molecules or allozymes 
having sensor domains comprising nucleic acid decoys and/or aptamers of the invention. 
Interaction of the nucleic acid sensor molecule's sensor domain with a molecular target, 
such as HTV gp41 or any other suitable HTV target, can activate or inactivate the 
enzymatic nucleic acid domain of the nucleic acid sensor molecule, such that the activity 
of the nucleic acid sensor molecule is modulated in the presence of the target-signaling 
molecule. The nucleic acid sensor molecule can be designed to be active in the presence 
of the target molecule or alternately, can be designed to be inactive in the presence of the 
molecular target. For example, a nucleic acid sensor molecule is designed with a sensor 
domain comprising an aptamer with binding specificity for HTV gp41. In a non-limiting 
example, interaction of the HIV gp41 with the sensor domain of the nucleic acid sensor 
molecule can activate the enzymatic nucleic acid domain of the nucleic acid sensor 
molecule such that the sensor molecule catalyzes a reaction, for example cleavage of HIV 
RNA. In this example, the nucleic acid sensor molecule is activated in the presence of 
HIV gp41, and can be used as a therapeutic to treat HTV infection. Alternately, the 
reaction can comprise cleavage or ligation of a labeled nucleic acid reporter molecule, 
providing a useful diagnostic reagent to detect the presence of HIV in a system. 

Synthesis of Nucleic acid Molecules 

Synthesis of nucleic acids greater than 100 nucleotides in length is difficult using 
automated methods, and the therapeutic cost of such molecules is prohibitive. In this 
invention, small nucleic acid motifs ("small" refers to nucleic acid motifs no more than 
about 100 nucleotides in length, preferably no more than 80 nucleotides in length, and 
most preferably no more than 50 nucleotides in length; e.g., decoy nucleic acid 
molecules, aptamer nucleic acid molecules antisense nucleic acid molecules, enzymatic 
nucleic acid molecules) are preferably used for exogenous delivery. The simple structure 
of these molecules increases the ability of the nucleic acid to invade targeted regions of 
protein and/or RNA structure. Exemplary molecules of the instant invention are 
chemically synthesized, and others can similarly be synthesized. 

Oligonucleotides (e.g., DNA oligonucleotides) are synthesized using protocols 
known in the art, for example as described in Caruthers et al, 1992, Methods in 
Enzymology 211, 3-19, Thompson et al, International PCT Publication No. WO 
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99/54459, Wincott et al, 1995, Nucleic Acids Res. 23, 2677-2684, Wincott et al, 1997, 
Methods Mol. Bio., 74, 59, Brennan et al, 1998, Biotechnol Bioeng., 61, 33-45, and 
Brennan, US patent No. 6,001,311. All of these references are incorporated herein by 
reference. The synthesis of oligonucleotides makes use of common nucleic acid 
5 protecting and coupling groups, such as dimethoxytrityl at the 5'-end, and 
phosphoramidites at the 3'-end. In a non-limiting example, small scale syntheses are 
conducted on a 394 Applied Biosystems, Inc. synthesizer using a 0.2 umol scale protocol 
with a 2.5 min coupling step for 2'-0-methylated nucleotides and a 45 sec coupling step 
for 2'-deoxy nucleotides. Table II outlines the amounts and the contact times of the 
10 reagents used in the synthesis cycle. Alternatively, syntheses at the 0.2 umol scale can be 
performed on a 96-well plate synthesizer, such as the instrument produced by Protogene 
(Palo Alto, CA) with minimal modification to the cycle. A 33-fold excess (60 uL of 0.1 1 
M = 6.6 umol) of 2'-0-methyl phosphoramidite and a 105-fold excess of S-ethyl tetrazole 
(60 uL of 0.25 M = 15 umol) can be used in each coupling cycle of 2'-0-methyl residues 
15 relative to polymer-bound 5'-hydroxyl. A 22-fold excess (40 uL of 0.1 1 M = 4.4 umol) 
of deoxy phosphoramidite and a 70-fold excess of S-ethyl tetrazole (40 uL of 0.25 M = 
10 umol) can be used in each coupling cycle of deoxy residues relative to polymer-bound 
5'-hydroxyl. Average coupling yields on the 394 Applied Biosystems, Inc. synthesizer, 
determined by colorimetric quantitation of the trityl fractions, are typically 97.5-99%. 
20 Other oligonucleotide synthesis reagents for the 394 Applied Biosystems, Inc. synthesizer 
include the following: detritylation solution is 3% TCA in methylene chloride (ABI); 
capping is performed with 16% ^-methyl imidazole in THF (ABI) and 10% acetic 
anhydride/10% 2,6-lutidine in THF (ABI); and oxidation solution is 16.9 niM 12, 49 mM 
pyridine, 9% water in THF (PERSEPTIVE™). Burdick & Jackson Synthesis Grade 
25 acetonitrile is used directly from the reagent bottle. S-Ethyltetrazole solution (0.25 M in 
acetonitrile) is made up from the solid obtained from American International Chemical, 
Inc. Alternately, for the introduction of phosphorothioate linkages, Beaucage reagent 
(3H-l,2-Benzodithiol-3-one 1,1-dioxide, 0.05 M in acetonitrile) is used. 

Deprotection of the DNA-based oligonucleotides is performed as follows: the 
30 polymer-bound trityl-on oligoribonucleotide is transferred to a 4 mL glass screw top vial 
and suspended in a solution of 40% aq. methylamine (1 mL) at 65 °C for 10 min. After 
cooling to -20 °C, the supernatant is removed from the polymer support. The support is 
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washed three times with 1.0 mL of EtOH:MeCN:H20/3:l:l, vortexed and the supernatant 
is then added to the first supernatant. The combined supernatants, containing the 
oligoribonucleotide, are dried to a white powder. 

The method of synthesis used for normal RNA including certain decoy nucleic acid 
5 molecules and enzymatic nucleic acid molecules follows the procedure as described in 
Usman et al t 1987, J. Am. Chem. Soc, 109, 7845; Scaringe et al, 1990, Nucleic Acids 
Res., 18, 5433; and Wincott et al, 1995, Nucleic Acids Res. 23, 2677-2684 Wincott et 
al, 1997, Methods Mol Bio., 14, 59, and makes use of common nucleic acid protecting 
and coupling groups, such as dimethoxytrityl at the 5'-end, and phosphoramidites at the 
10 3'-end. In a non-limiting example, small scale syntheses are conducted on a 394 Applied 
Biosystems, Inc. synthesizer using a 0.2 nmol scale protocol with a 7.5 min coupling step 
for alkylsilyl protected nucleotides and a 2.5 min coupling step for 2'-0-methylated 
nucleotides. Table II outlines the amounts and the contact times of the reagents used in 
the synthesis cycle. Alternatively, syntheses at the 0.2 |imol scale can be done on a 96- 
15 well plate synthesizer, such as the instrument produced by Protogene (Palo Alto, CA) 
with minimal modification to the cycle. A 33-fold excess (60 \iL of 0.11 M = 6.6 nmol) 
of 2'-Omethyl phosphoramidite and a 75-fold excess of S-ethyl tetrazole (60 fiL of 0.25 
M = 15 (xmol) can be used in each coupling cycle of 2'-0-methyl residues relative to 
polymer-bound 5'-hydroxyl. A 66-fold excess (120 |iL of 0.11 M = 13.2 nmol) of 
20 alkylsilyl (ribo) protected phosphoramidite and a 150-fold excess of S-ethyl tetrazole 
(120 nL of 0.25 M = 30 \imo\) can be used in each coupling cycle of ribo residues 
relative to polymer-bound 5'-hydroxyl. Average coupling yields on the 394 Applied 
Biosystems, Inc. synthesizer, determined by colorimetric quantitation of the trityl 
fractions, are typically 97.5-99%. Other oligonucleotide synthesis reagents for the 394 
25 Applied Biosystems, Inc. synthesizer include the following: detritylation solution is 3% 
TCA in methylene chloride (ABI); capping is performed with 16% AT-methyl imidazole in 
THF (ABI) and 10% acetic anhydride/10% 2,6-lutidine in THF (ABI); oxidation solution 
is 16.9 mM I2 ? 49 mM pyridine, 9% water in THF (PERSEPTIVE™). Burdick & 
Jackson Synthesis Grade acetonitrile is used directly from the reagent bottle. S- 
30 Ethyltetrazole solution (0.25 M in acetonitrile) is made up from the solid obtained from 
American International Chemical, Inc. Alternately, for the introduction of 
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phosphorothioate linkages, Beaucage reagent (3H-l,2-Benzodithiol-3-one 1,1- 
dioxide0.05 M in acetonitrile) is used. 

Deprotection of the RNA is performed using either a two-pot or one-pot protocol. 
For the two-pot protocol, the polymer-bound trityl-on oligoribonucleotide is transferred to 
a 4 mL glass screw top vial and suspended in a solution of 40% aq. methylamine (1 mL) 
at 65 °C for 10 min. After cooling to -20 °C, the supernatant is removed from the 
polymer support. The support is washed three times with 1.0 mL of 
EtOH:MeCN:H20/3:l:l, vortexed and the supernatant is then added to the first 
supernatant. The combined supernatants, containing the oligoribonucleotide, are dried to 
a white powder. The base deprotected oligoribonucleotide is resuspended in anhydrous 
TEA/HF/NMP solution (300 uL of a solution of 1.5 mL N-methylpyrrolidinone, 750 uL 
TEA and 1 mL TEA*3HF to provide a 1.4 M HF concentration) and heated to 65 °C. 
After 1 .5 h, the oligomer is quenched with 1 .5 M NH4HCO3. 

Alternatively, for the one-pot protocol, the polymer-bound trityl-on 
oligoribonucleotide is transferred to a 4 mL glass screw top vial and suspended in a 
solution of 33% ethanolic methylamine/DMSO: 1/1 (0.8 mL) at 65 °C for 15 min. The 
vial is brought to r.t. TEA-3HF (0.1 mL) is added and the vial is heated at 65 °C for 15 
min. The sample is cooled at -20 °C and then quenched with 1 .5 M NH4HCO3. 

For purification of the trityl-on oligomers, the quenched NH4HCO3 solution is 
loaded onto a C-18 containing cartridge that had been prewashed with acetonitrile 
followed by 50 mM TEAA. After washing the loaded cartridge with water, the RNA is 
detritylated with 0.5% TFA for 13 min. The cartridge is then washed again with water, 
salt exchanged with 1 M NaCl and washed with water again. The oligonucleotide is then 
eluted with 30% acetonitrile. 

Inactive hammerhead ribozymes or binding attenuated control (BAC) 
oligonucleotides are synthesized by substituting a U for G5 and a U for A14 (numbering 
from Hertel, K. J., et al, 1992, Nucleic Acids Res. . 20, 3252). Similarly, one or more 
nucleotide substitutions can be introduced in other nucleic acid molecules, such as 
aptamers, to inactivate the molecule and such molecules can serve as a negative control. 
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The average stepwise coupling yields are typically >98% (Wincott et al, 1995 
Nucleic Acids Res. 23, 2677-2684). Those of ordinary skill in the art will recognize that 
the scale of synthesis can be adapted to be larger or smaller than the example described 
above including but not limited to 96-well format, all that is important is the ratio of 
5 chemicals used in the reaction. 

Alternatively, the nucleic acid molecules of the present invention can be 
synthesized separately and joined together post-synthetically, for example, by ligation 
(Moore et al, 1992, Science 256, 9923; Draper et al, International PCT publication No. 
WO 93/23569; Shabarova et al, 1991, Nucleic Acids Research 19, 4247; Bellon et al, 
10 1997, Nucleosides & Nucleotides, 16, 951; Bellon et al, 1997, Bioconjugate Chem. 8, 
204). 

The nucleic acid molecules of the present invention can be modified extensively to 
enhance stability by modification with nuclease resistant groups, for example, 2'-amino, 
2'-C-allyl, 2'-flouro, 2'-0-methyl, 2'-H (for a review see Usman and Cedergren, 1992, 
15 TIBS 17, 34; Usman et al, 1994, Nucleic Acids Symp. Ser. 31, 163). Nucleic acid 
molecules of the invention can be purified by gel electrophoresis using general methods 
or can be purified by high pressure liquid chromatography (HPLC; see Wincott et al, 
supra, the totality of which is hereby incorporated herein by reference) and re-suspended 
in water. 

20 Optimizing Activity of the nucleic acid molecule of the invention. 

Chemically synthesizing nucleic acid molecules with modifications (base, sugar 
and/or phosphate) can prevent their degradation by serum ribonucleases, which can 
increase their potency (see e.g., Eckstein et al, International Publication No. WO 
92/07065; Perrault et al, 1990 Nature 344, 565; Pieken et al., 1991, Science 253, 314; 

25 Usman and Cedergren, 1992, Trends in Biochem. Sci. 17, 334; Usman et al, International 
Publication No. WO 93/15187; and Rossi et al, International Publication No. WO 
91/03162; Sproat, US Patent No. 5,334,711; Gold et al, US 6,300,074; and Burgin et al, 
supra; all of which are incorporated by reference herein). All of the above references 
describe various chemical modifications that can be made to the base, phosphate and/or 

30 sugar moieties of the nucleic acid molecules described herein. Modifications that 
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enhance their efficacy in cells, and removal of bases from nucleic acid molecules to 
shorten oligonucleotide synthesis times and reduce chemical requirements are desired. 

There are several examples in the art describing sugar, base and phosphate 
modifications that can be introduced into nucleic acid molecules with significant 

5 enhancement in their nuclease stability and efficacy. For example, oligonucleotides are 
modified to enhance stability and/or enhance biological activity by modification with 
nuclease resistant groups, for example, 2'-amino, 2'-C-allyl, 2*-flouro, 2'-0-methyl, 2'-H, 
nucleotide base modifications (for a review see Usman and Cedergren, 1992, TIBS. 17, 
34; Usman et al, 1994, Nucleic Acids Symp. Ser. 31, 163; Burgin et al, 1996, 

10 Biochemistry, 35, 14090). Sugar modification of nucleic acid molecules have been 
extensively described in the art (see Eckstein et al, International Publication PCT No. 
WO 92/07065; Perrault et al. Nature, 1990, 344, 565-568; Pieken et al. Science, 1991, 
253, 314-317; Usman and Cedergren, Trends in Biochem. Sci. , 1992, 17, 334-339; 
Usman et al. International Publication PCT No. WO 93/15187; Sproat, US Patent No. 

15 5,334,711 and Beigelman et al, 1995, J. Biol. Chem., 270, 25702; Beigelman et al, 
International PCT publication No. WO 97/26270; Beigelman et al, US Patent No. 
5,716,824; Usman et al, US patent No. 5,627,053; Woolf et al, International PCT 
Publication No. WO 98/13526; Thompson et al, USSN 60/082,404 which was filed on 
April 20, 1998; Karpeisky et al, 1998, Tetrahedron Lett., 39, 1131; Eamshaw and Gait, 

20 1998, Biopolymers (Nucleic Acid Sciences), 48, 39-55; Verma and Eckstein, 1998, Annu. 
Rev. Biochem., 67, 99-134; and Burlina et al, 1997, Bioorg. Med. Chem., 5, 1999-2010; 
all of the references are hereby incorporated in their totality by reference herein). Such 
publications describe general methods and strategies to determine the location of 
incorporation of sugar, base and/or phosphate modifications and the like into ribozymes 

25 without modulating catalysis, and are incorporated by reference herein. In view of such 
teachings, similar modifications can be used as described herein to modify the nucleic 
acid molecules of the instant invention. 

While chemical modification of oligonucleotide intemucleotide linkages with 
phosphorothioate, phosphorothioate, and/or 5'-methylphosphonate linkages improves 
30 stability, excessive modifications can cause some toxicity. Therefore, when designing 
nucleic acid molecules, the amount of these intemucleotide linkages should be 
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minimized. The reduction in the concentration of these linkages should lower toxicity, 
resulting in increased efficacy and higher specificity of these molecules. 

Nucleic acid molecules having chemical modifications that maintain or enhance 
activity are provided. Such a nucleic acid is also generally more resistant to nucleases 
than an unmodified nucleic acid. Accordingly, the in vitro and/or in vivo activity should 
not be significantly lowered. In cases in which modulation is the goal, therapeutic nucleic 
acid molecules delivered exogenously should optimally be stable within cells until 
translation of the target RNA has been modulated long enough to reduce the levels of the 
undesirable protein. This period of time varies between hours to days depending upon the 
disease state. Improvements in the chemical synthesis of RNA and DNA (Wincott et al, 
1995 Nucleic Acids Res. 23, 2677; Caruthers et al., 1992, Methods in Enzymology 21 1,3- 
19 (incorporated by reference herein)) have expanded the ability to modify nucleic acid 
molecules by introducing nucleotide modifications to enhance their nuclease stability, as 
described above. 

hi one embodiment, nucleic acid molecules of the invention include one or more 
G-clamp nucleotides. A G-clamp nucleotide is a modified cytosine analog wherein the 
modifications confer the ability to hydrogen bond both Watson-Crick and Hoogsteen 
faces of a complementary guanine within a duplex, see for example Lin and Matteucci, 
1998, J. Am. Chem. Soc, 120, 8531-8532. A single G-clamp analog substation within an 
oligonucleotide can result in substantially enhanced helical thermal stability and 
mismatch discrimination when hybridized to complementary oligonucleotides. The 
inclusion of such nucleotides in nucleic acid molecules of the invention results in both 
enhanced affinity and specificity to nucleic acid targets. In another embodiment, nucleic 
acid molecules of the invention include one or more LNA "locked nucleic acid" 
nucleotides such as a 2', 4'-C mythylene bicyclo nucleotide (see for example Wengel et 
al, International PCT Publication No. WO 00/66604 and WO 99/14226). 

In another embodiment, the invention features conjugates and/or complexes of 
nucleic acid molecules targeting HIV. Such conjugates and/or complexes can be used to 
facilitate delivery of molecules into a biological system, such as a cell. The conjugates 
and complexes provided by the instant invention can impart therapeutic activity by 
transferring therapeutic compounds across cellular membranes, altering the 
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pharmacokinetics, and/or modulating the localization of nucleic acid molecules of the 
invention. The present invention encompasses the design and synthesis of novel 
conjugates and complexes for the delivery of molecules, including, but not limited to, 
small molecules, lipids, phospholipids, nucleosides, nucleotides, nucleic acids, antibodies, 

5 toxins, negatively charged polymers and other polymers, for example proteins, peptides, 
hormones, carbohydrates, polyethylene glycols, or polyamines, across cellular 
membranes. In general, the transporters described are designed to be used either 
individually or as part of a multi-component system, with or without degradable linkers. 
These compounds are expected to improve delivery and/or localization of nucleic acid 

10 molecules of the invention into a number of cell types originating from different tissues, 
in the presence or absence of serum (see Sullenger and Cech, US 5,854,038). Conjugates 
of the molecules described herein can be attached to biologically active molecules via 
linkers that are biodegradable, such as biodegradable nucleic acid linker molecules. 

The term "biodegradable nucleic acid linker molecule" as used herein, refers to a 
15 nucleic acid molecule that is designed as a biodegradable linker to connect one molecule 
to another molecule, for example, a biologically active molecule. The stability of the 
biodegradable nucleic acid linker molecule can be modulated by using various 
combinations of ribonucleotides, deoxyribonucleotides, and chemically modified 
nucleotides, for example, 2'-0-methyl, 2'-fluoro, 2'-amino, 2'-0-amino, 2'-C-allyl, 2*-0- 
20 allyl, and other 2'-modified or base modified nucleotides. The biodegradable nucleic acid 
linker molecule can be a dimer, trimer, tetramer or longer nucleic acid molecule, for 
example, an oligonucleotide of about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, or 20 nucleotides in length, or can comprise a single nucleotide with a 
phosphorus-based linkage, for example, a phosphoramidate or phosphodiester linkage. 
25 The biodegradable nucleic acid linker molecule can also comprise nucleic acid backbone, 
nucleic acid sugar, or nucleic acid base modifications. 

The term "biodegradable" as used herein, refers to degradation in a biological 
system, for example enzymatic degradation or chemical degradation. 

The term "biologically active molecule" as used herein, refers to compounds or 
30 molecules that are capable of eliciting or modifying a biological response in a system. 
Non-limiting examples of biologically active molecules contemplated by the instant 
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invention include therapeutically active molecules such as antibodies, hormones, 
antivirals, peptides, proteins, chemotherapeutics, small molecules, vitamins, co-factors, 
nucleosides, nucleotides, oligonucleotides, enzymatic nucleic acids, antisense nucleic 
acids, triplex forming oligonucleotides, 2,5-A chimeras, siRNA, dsRNA, allozymes, 
5 aptamers, decoys and analogs thereof. Biologically active molecules of the invention also 
include molecules capable of modulating the pharmacokinetics and/or pharmacodynamics 
of other biologically active molecules, for example, lipids and polymers such as 
polyamines, polyamides, polyethylene glycol and other polyethers. 

The term "phospholipid" as used herein, refers to a hydrophobic molecule 
10 comprising at least one phosphorus group. For example, a phospholipid can comprise a 
phosphorus-containing group and saturated or unsaturated alkyl group, optionally 
substituted with OH, COOH, oxo, amine, or substituted or unsubstituted aryl groups. 

Therapeutic nucleic acid molecules of the invention delivered exogenously 
optimally are stable within cells such that therapeutic activity is achieved. The nucleic 
15 acid molecules can therefore be designed such that they resistant to nucleases and 
function as effective intracellular therapeutic agents. Improvements in the chemical 
synthesis of nucleic acid molecules described in the instant invention and in the art have 
expanded the ability to modify nucleic acid molecules by introducing nucleotide 
modifications to enhance their nuclease stability as described above. 

20 In yet another embodiment, nucleic acid molecules having chemical modifications 

that maintain or enhance enzymatic activity and/or nuclease stability are provided. Such 
nucleic acids are also generally more resistant to nucleases than unmodified nucleic acids. 
Thus, in vitro and/or in vivo the activity should not be significantly lowered. As 
exemplified herein, such nucleic acid molecules are useful in vitro and/or in vivo even if 

25 activity over all is reduced 10 fold (Burgin et aL, 1996, Biochemistry, 35, 14090). 

Use of the nucleic acid-based molecules of the invention will lead to better 
treatment of the disease progression by affording the possibility of combination therapies 
(e.g., multiple nucleic acid molecules targeted to different genes; nucleic acid molecules 
coupled with known small molecule modulators; or intermittent treatment with 
30 combinations of molecules (including different motifs) and/or other chemical or 
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biological molecules. The treatment of patients with nucleic acid molecules may also 
include combinations of different types of nucleic acid molecules. 



In another aspect the nucleic acid molecules comprise a 5' and/or a 3'- cap 
structure. 

By "cap structure" is meant chemical modifications, which have been incorporated 
at either terminus of the oligonucleotide (see, for example, Wincott et al s WO 97/26270, 
incorporated by reference herein). These terminal modifications protect the nucleic acid 
molecule from exonuclease degradation, and may help in delivery and/or localization 
within a cell. The cap may be present at the 5 '-terminus (5 '-cap) or at the 3 '-terminal (3'- 
cap) or may be present on both termini. In non-limiting examples the 5'-cap is selected 
from inverted abasic residue (moiety); 4',5-methylene nucleotide; l-(beta-D- 
erythrofuranosyl) nucleotide, 4 f -thio nucleotide; carbocyclic nucleotide; 1,5- 
anhydrohexitol nucleotide; L-nucleotides; alpha-nucleotides; modified base nucleotide; 
phosphorodithioate linkage; rtreo-pentofiiranosyl nucleotide; acyclic 3',4'-seco 
nucleotide; acyclic 3,4-dihydroxybutyl nucleotide; acyclic 3,5-dihydroxypentyl 
nucleotide, 3'-3 , -inverted nucleotide moiety; 3'-3'-inverted abasic moiety; 3'-2'-inverted 
nucleotide moiety; 3'-2'-inverted abasic moiety; 1,4-butanediol phosphate; 3- 
phosphoramidate; hexylphosphate; aminohexyl phosphate; 3'-phosphate; 3'- 
phosphorothioate; phosphorodithioate; or bridging or non-bridging methylphosphonate 
moiety (for more details, see Wincott et aL, International PCT publication No. WO 
97/26270, incorporated by reference herein). 

In another embodiment, the 3'-cap is selected from 4 , ,5 , -methylene nucleotide; 1- 
(beta-D-erythrofuranosyl) nucleotide; 4 , -thio nucleotide, carbocyclic nucleotide; 5- 
amino-alkyl phosphate; l,3-diamino-2~propyl phosphate; 3-aminopropyl phosphate; 6- 
aminohexyl phosphate; 1,2-aminododecyl phosphate; hydroxypropyl phosphate; 1,5- 
anhydrohexitol nucleotide; L-nucleotide; alpha-nucleotide; modified base nucleotide; 
phosphorodithioate; //?ra?-pentofiiranosyl nucleotide; acyclic 3 , ,4 , -seco nucleotide; 3,4- 
dihydroxybutyl nucleotide; 3,5-dihydroxypentyl nucleotide, 5'-5'-inverted nucleotide 
moiety; S'-S'-inverted abasic moiety; 5 ! -phosphoramidate; 5'-phosphorothioate; 1,4- 
butanediol phosphate; 5'-amino; bridging and/or non-bridging 5'-phosphoramidate, 
phosphorothioate and/or phosphorodithioate, bridging or non bridging 
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methylphosphonate and 5 ? -mercapto moieties (for more details see Beaucage and Iyer, 
1993, Tetrahedron 49, 1925; incorporated by reference herein). 

By the term ?, non-nucleotide M is meant any group or compound which can be 
incorporated into a nucleic acid chain in the place of one or more nucleotide units, 
5 including either sugar and/or phosphate substitutions, and allows the remaining bases to 
exhibit their enzymatic activity. The group or compound is abasic in that it does not 
contain a commonly recognized nucleotide base, such as adenosine, guanine, cytosine, 
uracil or thymine. 

An "alkyl" group refers to a saturated aliphatic hydrocarbon, including straight- 

10 chain, branched-chain, and cyclic alkyl groups. Preferably, the alkyl group has 1 to 12 
carbons. More preferably, it is a lower alkyl of from 1 to 7 carbons, more preferably 1 to 
4 carbons. The alkyl group can be substituted or unsubstituted. When substituted the 
substituted group(s) is preferably, hydroxyl, cyano, alkoxy, =0, =S, N02 or N(CH3)2, 
amino, or SH. The term also includes alkenyl groups that are unsaturated hydrocarbon 

15 groups containing at least one carbon-carbon double bond, including straight-chain, 
branched-chain, and cyclic groups. Preferably, the alkenyl group has 1 to 12 carbons. 
More preferably, it is a lower alkenyl of from 1 to 7 carbons, more preferably 1 to 4 
carbons. The alkenyl group may be substituted or unsubstituted. When substituted the 
substituted group(s) is preferably, hydroxyl, cyano, alkoxy, =0, =S, NO2, halogen, 

20 N(CH3)2> amino, or SH. The term "alkyl" also includes alkynyl groups that have an 
unsaturated hydrocarbon group containing at least one carbon-carbon triple bond, 
including straight-chain, branched-chain, and cyclic groups. Preferably, the alkynyl 
group has 1 to 12 carbons. More preferably, it is a lower alkynyl of from 1 to 7 carbons, 
more preferably 1 to 4 carbons. The alkynyl group may be substituted or unsubstituted. 

25 When substituted the substituted group(s) is preferably, hydroxyl, cyano, alkoxy, =0, =S, 
NO2 or N(CH3)2, amino or SH. 

Such alkyl groups may also include aryl, alkylaryl, carbocyclic aryl, heterocyclic 
aryl, amide and ester groups. An "aryl" group refers to an aromatic group that has at least 
one ring having a conjugated pi electron system and includes carbocyclic aryl, 
30 heterocyclic aryl and biaryl groups, all of which may be optionally substituted. The 
preferred substituent(s) of aryl groups are halogen, trihalomethyl, hydroxyl, SH, OH, 
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cyano, alkoxy, alkyl, alkenyl, alkynyl, and amino groups. An "alkylaryl" group refers to 
an alkyl group (as described above) covalently joined to an aryl group (as described 
above). Carbocyclic aryl groups are groups wherein the ring atoms on the aromatic ring 
are all carbon atoms. The carbon atoms are optionally substituted. Heterocyclic aryl 

5 groups are groups having from 1 to 3 heteroatoms as ring atoms in the aromatic ring and 
the remainder of the ring atoms are carbon atoms. Suitable heteroatoms include oxygen, 
sulfur, and nitrogen, and include furanyl, thienyl, pyridyl, pyrrolyl, N-lower alkyl pyrrolo, 
pyrirnidyl, pyrazinyl, imidazolyl and the like, all optionally substituted. An "amide" 
refers to an -C(0)-NH-R, where R is either alkyl, aryl, alkylaryl or hydrogen. An "ester" 

10 refers to an -C(0)-OR', where R is either alkyl, aryl, alkylaryl or hydrogen. 

By "nucleotide" as used herein is as recognized in the art to include natural bases 
(standard), and modified bases well known in the art. Such bases are generally located at 
the 1' position of a nucleotide sugar moiety. Nucleotides generally comprise a base, 
sugar and a phosphate group. The nucleotides can be unmodified or modified at the sugar, 
15 phosphate and/or base moiety, (also referred to interchangeably as nucleotide analogs, 
modified nucleotides, non-natural nucleotides, non-standard nucleotides and other; see, 
for example, Usman and McSwiggen, supra; Eckstein et al, International PCT 
Publication No. WO 92/07065; Usman et al, International PCT Publication No. WO 
93/15187; Uhlman & Peyman, supra, all are hereby incorporated by reference herein). 
20 There are several examples of modified nucleic acid bases known in the art as 
summarized by Limbach et al, 1994, Nucleic Acids Res. 22, 2183. Some of the non- 
limiting examples of base modifications that can be introduced into nucleic acid 
molecules include, inosine, purine, pyridin-4-one, pyridin-2-one, phenyl, pseudouracil, 2, 
4, 6-trimethoxy benzene, 3-methyl uracil, dihydrouridine, naphthyl, aminophenyl, 
25 5-alkylcytidines (e.g., 5-methylcytidine), 5-alkyluridines (e.g., ribothymidine), 
5-halouridine (e.g., 5-bromouridine) or 6-azapyrimidines or 6-alkylpyrimidines (e.g. 6- 
methyluridine), propyne, and others (Burgin et al, 1996, Biochemistry, 35, 14090; 
Uhlman & Peyman, supra). By "modified bases" in this aspect is meant nucleotide bases 
other than adenine, guanine, cytosine and uracil at 1' position or their equivalents; such 
30 bases may be used at any position, for example, within the catalytic core of a nucleic acid 
decoy molecule and/or in the substrate-binding regions of the nucleic acid molecule. 
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In one embodiment, the invention features modified nucleic acids, for example 
aptamers, siRNA, antisense, and enzymatic nucleic acid moelcules with phosphate 
backbone modifications comprising one or more phosphorothioate, phosphorodithioate, 
methylphosphonate, morpholino, amidate carbamate, carboxymethyl, acetamidate, 

5 polyamide, sulfonate, sulfonamide, sulfamate, formacetal, thioformacetal, and/or 
alkylsilyl, substitutions. For a review of oligonucleotide backbone modifications, see 
Hunziker and Leumann, 1995, Nucleic Acid Analogues: Synthesis and Properties, in 
Modern Synthetic Methods, VCH, 331-417, and Mesmaeker et al, 1994, Novel Backbone 
Replacements for Oligonucleotides, in Carbohydrate Modifications in Antisense 

10 Research, ACS, 24-39. 

By "abasic" is meant sugar moieties lacking a base or having other chemical groups 
in place of a base at the V position, (for more details, see Usman et al, US 5,891,683 and 
Matulic-Adamic et al, US 5,998,203). 

By "unmodified nucleoside" is meant one of the bases adenine, cytosine, guanine, 
1 5 thymine, uracil joined to the T carbon of (3-D-ribo-furanose. 

By "modified nucleoside" is meant any nucleotide base which contains a 
modification in the chemical structure of an unmodified nucleotide base, sugar and/or 
phosphate. 

In connection with 2' -modified nucleotides as described for the present invention, 
20 by "amino" is meant 2'-NH 2 or V-O- NH 2 , which may be modified or unmodified. Such 
modified groups are described, for example, in Eckstein et al, U.S. Patent 5,672,695 and 
Matulic-Adamic et al, WO 98/28317. 

Various modifications to nucleic acid (e.g., aptamer, siRNA, antisense and 
enzymatic nucleic acid) structure can be made to enhance the utility of these molecules. 
25 Such modifications will enhance shelf-life, half-life in vitro, stability, and ease of 
introduction of such oligonucleotides to the target site, e.g., to enhance penetration of 
cellular membranes, and confer the ability to recognize and bind to targeted cells. 
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Methods for the delivery of nucleic acid molecules are described in Akhtar et al, 
1992, Trends Cell Bio., 2, 139; Deliveiy Strategies for Antisense Oligonucleotide 
Therapeutics, ed. Akhtar, 1995, Maurer et al, 1999, Mol Membr. Biol, 16, 129440; 

5 Hofland and Huang, 1999, Handb. Exp. Pharmacol, 137, 165-192; and Lee et al, 2000, 
ACS Symp. Ser., 752, 184-192, all of which are incorporated herein by reference. 
Sullivan et al, PCT WO 94/02595, further describes the general methods for delivery of 
enzymatic nucleic acid molecules. These protocols can be utilized for the delivery of 
virtually any nucleic acid molecule. Nucleic acid molecules can be administered to cells 

10 by a variety of methods known to those of skill in the art, including, but not restricted to, 
encapsulation in liposomes, by iontophoresis, or by incorporation into other vehicles, 
such as hydrogels, cyclodextrins, biodegradable nanocapsules, and bioadhesive 
microspheres, or by proteinaceous vectors (O'Hare and Normand, International PCT 
Publication No. WO 00/53722). Alternatively, the nucleic acid/vehicle combination is 

1 5 locally delivered by direct inj ection or by use of an infusion pump. Direct inj ection of the 
nucleic acid molecules of the invention, whether subcutaneous, intramuscular, or 
intradermal, can take place using standard needle and syringe methodologies, or by 
needle-free technologies such as those described in Conry et al, 1999, Clin. Cancer Res. , 
5, 2330-2337 and Barry et al, International PCT Publication No. WO 99/31262. The 

20 molecules of the instant invention can be used as pharmaceutical agents. Pharmaceutical 
agents prevent, modulate the occurrence, or treat (alleviate a symptom to some extent, 
preferably all of the symptoms) of a disease state in a patient. 

Thus, the invention features a pharmaceutical composition comprising one or more 
nucleic acid(s) of the invention in an acceptable carrier, such as a stabilizer, buffer, and 

25 the like. The negatively charged polynucleotides of the invention can be administered 
{e.g., RNA, DNA or protein) and introduced into a patient by any standard means, with or 
without stabilizers, buffers, and the like, to form a pharmaceutical composition. When it 
is desired to use a liposome delivery mechanism, standard protocols for formation of 
liposomes can be followed. The compositions of the present invention may also be 

30 formulated and used as tablets, capsules or elixirs for oral administration, suppositories 
for rectal administration, sterile solutions, suspensions for injectable administration, and 
the other compositions known in the art. 



49 



WO 03/102131 PCT/US03/12626 

The present invention also includes pharmaceutical^ acceptable formulations of the 
compounds described. These formulations include salts of the above compounds, e.g., 
acid addition salts, for example, salts of hydrochloric, hydrobromic, acetic acid, and 
benzene sulfonic acid. 

5 A pharmacological composition or formulation refers to a composition or 

formulation in a form suitable for administration, e.g., systemic administration, into a cell 
or patient, including for example a human. Suitable forms, in part, depend upon the use 
or the route of entry, for example oral, transdermal, or by injection. Such forms should 
not prevent the composition or formulation from reaching a target cell (i.e., a cell to 

10 which the negatively charged nucleic acid is desirable for delivery). For example, 
pharmacological compositions injected into the blood stream should be soluble. Other 
factors are known in the art, and include considerations such as toxicity and forms that 
prevent the composition or formulation from exerting its effect. 

By "systemic administration" is meant in vivo systemic absorption or accumulation 
15 of drugs in the blood stream followed by distribution throughout the entire body. 
Administration routes which lead to systemic absorption include, without limitation: 
intravenous, subcutaneous, intraperitoneal, inhalation, oral, intrapulmonary and 
intramuscular. Each of these administration routes expose the desired negatively charged 
polymers, e.g., nucleic acids, to an accessible diseased tissue. The rate of entry of a drug 
20 into the circulation has been shown to be a function of molecular weight or size. The use 
of a liposome or other drug carrier comprising the compounds of the instant invention can 
potentially localize the drug, for example, in certain tissue types, such as the tissues of the 
reticular endothelial system (RES). A liposome formulation that can facilitate the 
association of drug with the surface of cells, such as, lymphocytes and macrophages is 
25 also useful. This approach may provide enhanced delivery of the drug to target cells by 
taking advantage of the specificity of macrophage and lymphocyte immune recognition of 
abnormal cells, such as cancer cells. 

By "pharmaceutical^ acceptable formulation" is meant, a composition or 
formulation that allows for the effective distribution of the nucleic acid molecules of the 
30 instant invention in the physical location most suitable for their desired activity. 
Nonlimiting examples of agents suitable for formulation with the nucleic acid molecules 
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of the instant invention include: P-glycoprotein inhibitors (such as Pluronic P85), which 
can enhance entry of drugs into the CNS (Jolliet-Riant and Tillement, 1999, Fundam. 
Clin. Pharmacol, 13, 16-26); biodegradable polymers, such as poly (DL-lactide- 
coglycolide) microspheres for sustained release delivery after intracerebral implantation 
(Emerich, DF et al 9 1999, Cell Transplant, 8, 47-58) (Alkermes, Inc. Cambridge, MA); 
and loaded nanoparticles, such as those made of polybutylcyanoacrylate, which can 
deliver drugs across the blood brain barrier and can alter neuronal uptake mechanisms 
(Prog Neuropsychopharmacol Biol Psychiatry, 23, 941-949, 1999). Other non-limiting 
examples of delivery strategies for the nucleic acid molecules of the instant invention 
include material described in Boado et al, 1998, J. Pharm. ScL, 87, 1308-1315; Tyler et 
aLf l999y FEBSLetL, 421, 280-284; Pardridge et al, 1995, PNAS USA, 92, 5592-5596; 
Boado, 1995, Adv. Drug Delivery Rev., 15, 73-107; Aldrian-Herrada et al., 1998, Nucleic 
Acids Res., 26, 4910-4916; and Tyler et al, 1999, PNAS USA, 96, 7053-7058. 

The invention also features the use of the composition comprising surface-modified 
liposomes containing poly (ethylene glycol) lipids (PEG-modified, or long-circulating 
liposomes or stealth liposomes). These formulations offer a method for increasing the 
accumulation of drugs in target tissues. This class of drug carriers resists opsonization 
and elimination by the mononuclear phagocytic system (MPS or RES), thereby enabling 
longer blood circulation times and enhanced tissue exposure for the encapsulated drug 
(Lasic et al. Chem. Rev. 1995, 95, 2601-2627; Ishiwata et al, Chem. Pharm. Bull. 1995, 
43, 1005-1011). Such liposomes have been shown to accumulate selectively in tumors, 
presumably by extravasation and capture in the neovascularized target tissues (Lasic et 
al, Science 1995, 267, 1275-1276; Oku et a/.,1995, Biochim. Biophys. Acta, 1238, 86- 
90). The long-circulating liposomes enhance the pharmacokinetics and 
pharmacodynamics of DNA and RNA, particularly compared to conventional cationic 
liposomes which are known to accumulate in tissues of the MPS (Liu et al, 1 Biol 
Chem. 1995, 42, 24864-24870; Choi et al, International PCT Publication No. WO 
96/10391; Ansell et al, International PCT Publication No. WO 96/10390; Holland et al, 
International PCT Publication No. WO 96/10392). Long-circulating liposomes are also 
likely to protect drugs from nuclease degradation to a greater extent compared to cationic 
liposomes, based on their ability to avoid accumulation in metabolically aggressive MPS 
tissues such as the liver and spleen. 
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The present invention also includes compositions prepared for storage or 
administration, which include a pharmaceutically effective amount of the desired 
compounds in a pharmaceutically acceptable carrier or diluent. Acceptable carriers or 
diluents for therapeutic use are well known in the pharmaceutical art, and are described, 
for example, in Remington's Pharmaceutical Sciences, Mack Publishing Co. (A.R. 
Gennaro edit. 1985) hereby incorporated by reference herein. For example, preservatives, 
stabilizers, dyes and flavoring agents may be provided. These include sodium benzoate, 
sorbic acid and esters of^-hydroxybenzoic acid. In addition, antioxidants and suspending 
agents may be used. 

A pharmaceutically effective dose is that dose required to prevent, inhibit the 
occurrence of, or treat (alleviate a symptom to some extent, preferably all of the 
symptoms) a disease state. The pharmaceutically effective dose depends on the type of 
disease, the composition used, the route of administration, the type of mammal being 
treated, the physical characteristics of the specific mammal under consideration, 
concurrent medication, and other factors that those skilled in the medical arts will 
recognize. Generally, an amount between 0.1 mg/kg and 100 mg/kg body weight/day of 
active ingredients is administered dependent upon potency of the negatively charged 
polymer. 

The present invention also includes compositions prepared for storage or 
administration that include a pharmaceutically effective amount of the desired compounds 
in a pharmaceutically acceptable carrier or diluent. Acceptable carriers or diluents for 
therapeutic use are well known in the pharmaceutical art, and are described, for example, 
in Remington's Pharmaceutical Sciences, Mack Publishing Co. (A.R. Gennaro edit. 
1985), hereby incorporated by reference herein. For example, preservatives, stabilizers, 
dyes and flavoring agents can be provided. These include sodium benzoate, sorbic acid 
and esters of /?-hydroxybenzoic acid. In addition, antioxidants and suspending agents can 
be used. 

A pharmaceutically effective dose is that dose required to prevent, inhibit the 
occurrence, or treat (alleviate a symptom to some extent, preferably all of the symptoms) 
of a disease state. The pharmaceutically effective dose depends on the type of disease, 
the composition used, the route of administration, the type of mammal being treated, the 
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physical characteristics of the specific mammal under consideration, concurrent 
medication, and other factors that those skilled in the medical arts will recognize. 
Generally, an amount between 0.1 mg/kg and 100 mg/kg body weight/day of active 
ingredients is administered dependent upon potency of the negatively charged polymer. 

5 The nucleic acid molecules of the invention and formulations thereof can be 

administered orally, topically, parenterally, by inhalation or spray, or rectally in dosage 
unit formulations containing conventional non-toxic pharmaceutical^ acceptable carriers, 
adjuvants and/or vehicles. The term parenteral as used herein includes percutaneous, 
subcutaneous, intravascular (e.g., intravenous), intramuscular, or intrathecal injection or 

10 infusion techniques and the like. In addition, there is provided a pharmaceutical 
formulation comprising a nucleic acid molecule of the invention and a pharmaceutical^ 
acceptable carrier. One or more nucleic acid molecules of the invention can be present in 
association with one or more non-toxic pharmaceutical^ acceptable carriers and/or 
diluents and/or adjuvants, and if desired other active ingredients. The pharmaceutical 

1 5 compositions containing nucleic acid molecules of the invention can be in a form suitable 
for oral use, for example, as tablets, troches, lozenges, aqueous or oily suspensions, 
dispersible powders or granules, emulsion, hard or soft capsules, or syrups or elixirs. 

Compositions intended for oral use can be prepared according to any method 
known to the art for the manufacture of pharmaceutical compositions and such 
20 compositions can contain one or more such sweetening agents, flavoring agents, coloring 
agents or preservative agents in order to provide pharmaceutically elegant and palatable 
preparations. Tablets contain the active ingredient in admixture with non-toxic 
pharmaceutically acceptable excipients that are suitable for the manufacture of tablets. 
These excipients can be, for example, inert diluents; such as calcium carbonate, sodium 
25 carbonate, lactose, calcium phosphate or sodium phosphate; granulating and 
disintegrating agents, for example, corn starch, or alginic acid; binding agents, for 
example starch, gelatin or acacia; and lubricating agents, for example magnesium 
stearate, stearic acid or talc. The tablets can be uncoated or they can be coated by known 
techniques. In some cases such coatings can be prepared by known techniques to delay 
30 disintegration and absorption in the gastrointestinal tract and thereby provide a sustained 
action over a longer period. For example, a time delay material such as glyceryl 
monosterate or glyceryl distearate can be employed. 
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Formulations for oral use can also be presented as hard gelatin capsules wherein the 
active ingredient is mixed with an inert solid diluent, for example, calcium carbonate, 
calcium phosphate or kaolin, or as soft gelatin capsules wherein the active ingredient is 
mixed with water or an oil medium, for example peanut oil, liquid paraffin or olive oil 

Aqueous suspensions contain the active materials in admixture with excipients 
suitable for the manufacture of aqueous suspensions. Such excipients are suspending 
agents, for example sodium carboxymethylcellulose, methylcellulose, hydropropyl- 
methylcellulose, sodium alginate, polyvinylpyrrolidone, gum tragacanth and gum acacia; 
dispersing or wetting agents can be a naturally-occurring phosphatide, for example, 
lecithin, or condensation products of an alkylene oxide with fatty acids, for example 
polyoxyethylene stearate, or condensation products of ethylene oxide with long chain 
aliphatic alcohols, for example heptadecaethyleneoxycetanol, or condensation products of 
ethylene oxide with partial esters derived from fatty acids and a hexitol such as 
polyoxyethylene sorbitol monooleate, or condensation products of ethylene oxide with 
partial esters derived from fatty acids and hexitol anhydrides, for example polyethylene 
sorbitan monooleate. The aqueous suspensions can also contain one or more 
preservatives, for example ethyl, or n-propyl p-hydroxybenzoate, one or more coloring 
agents, one or more flavoring agents, and one or more sweetening agents, such as sucrose 
or saccharin. 

Oily suspensions can be formulated by suspending the active ingredients in a 
vegetable oil, for example arachis oil, olive oil, sesame oil or coconut oil, or in a mineral 
oil such as liquid paraffin. The oily suspensions can contain a thickening agent, for 
example beeswax, hard paraffin or cetyl alcohol. Sweetening agents and flavoring agents 
can be added to provide palatable oral preparations. These compositions can be preserved 
by the addition of an anti-oxidant such as ascorbic acid. 

Dispersible powders and granules suitable for preparation of an aqueous suspension 
by the addition of water provide the active ingredient in admixture with a dispersing or 
wetting agent, suspending agent and one or more preservatives. Suitable dispersing or 
wetting agents or suspending agents are exemplified by those already mentioned above. 
Additional excipients, for example sweetening, flavoring and coloring agents, can also be 
present. 
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Pharmaceutical compositions of the invention can also be in the form of oil-in- 
water emulsions. The oily phase can be a vegetable oil or a mineral oil or mixtures of 
these. Suitable emulsifying agents can be naturally-occurring gums, for example gum 
acacia or gum tragacanth, naturally-occurring phosphatides, for example soy bean, 
5 lecithin, and esters or partial esters derived from fatty acids and hexitol, anhydrides, for 
example sorbitan monooleate, and condensation products of the said partial esters with 
ethylene oxide, for example polyoxyethylene sorbitan monooleate. The emulsions can 
also contain sweetening and flavoring agents. 

Syrups and elixirs can be formulated with sweetening agents, for example glycerol, 
10 propylene glycol, sorbitol, glucose or sucrose. Such formulations can also contain a 
demulcent, a preservative and flavoring and coloring agents. The pharmaceutical 
compositions can be in the form of a sterile injectable aqueous or oleaginous suspension. 
This suspension can be formulated according to the known art using those suitable 
dispersing or wetting agents and suspending agents that have been mentioned above. The 
15 sterile injectable preparation can also be a sterile injectable solution or suspension in a 
non-toxic parentally acceptable diluent or solvent, for example as a solution in 1,3- 
butanediol. Among the acceptable vehicles and solvents that can be employed are water, 
Ringer's solution and isotonic sodium chloride solution. In addition, sterile, fixed oils are 
conventionally employed as a solvent or suspending medium. For this purpose, any bland 
20 fixed oil can be employed including synthetic mono-or diglycerides. In addition, fatty 
acids such as oleic acid find use in the preparation of injectables. 

The nucleic acid molecules of the invention can also be administered in the form of 
suppositories, e.g., for rectal administration of the drug. These compositions can be 
prepared by mixing the drug with a suitable non-irritating excipient that is solid at 
25 ordinary temperatures but liquid at the rectal temperature and will therefore melt in the 
rectum to release the drug. Such materials include cocoa butter and polyethylene glycols. 

Nucleic acid molecules of the invention can be administered parenterally in a sterile 
medium. The drug, depending on the vehicle and concentration used, can either be 
suspended or dissolved in the vehicle. Advantageously, adjuvants such as local 
30 anesthetics, preservatives and buffering agents can be dissolved in the vehicle. 
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Dosage levels of the order of from about 0.1 mg to about 140 mg per kilogram of 
body weight per day are useful in the treatment of the above-indicated conditions (about 
0.5 mg to about 7 g per patient per day). The amount of active ingredient that can be 
combined with the carrier materials to produce a single dosage form varies depending 
upon the host treated and the particular mode of administration. Dosage unit forms 
generally contain between from about 1 mg to about 500 mg of an active ingredient. 

It is understood that the specific dose level for any particular patient depends upon a 
variety of factors including the activity of the specific compound employed, the age, body 
weight, general health, sex, diet, time of aoministration, route of administration, and rate 
of excretion, drug combination and the severity of the particular disease undergoing 
therapy. 

For administration to non-human animals, the composition can also be added to the 
animal feed or drinking water. It can be convenient to formulate the animal feed and 
drinking water compositions so that the animal takes in a therapeutically appropriate 
quantity of the composition along with its diet. It can also be convenient to present the 
composition as a premix for addition to the feed or drinking water. 

The nucleic acid molecules of the present invention may also be administered to a 
patient in combination with other therapeutic compounds to increase the overall 
therapeutic effect. The use of multiple compounds to treat an indication may increase the 
beneficial effects while reducing the presence of side effects. 

In one embodiment, the invention compositions suitable for administering nucleic 
acid molecules of the invention to specific cell types, such as hepatocytes. For example, 
the asialoglycoprotein receptor (ASGPr) (Wu and Wu, 1987, J. Biol. Chem. 262, 4429- 
4432) is unique to hepatocytes and binds branched galactose-terminal glycoproteins, such 
as asialoorosomucoid (ASOR). Binding of such glycoproteins or synthetic 
glycoconjugates to the receptor takes place with an affinity that strongly depends on the 
degree of branching of the oligosaccharide chain, for example, triatennary structures are 
bound with greater affinity than biatenarry or monoatennary chains (Baenziger and Fiete, 
1980, Cell, 22, 611-620; Connolly et al, 1982, J. Biol. Chem., 257, 939-945). Lee and 
Lee, 1987, Glycoconjugate J., 4, 317-328, obtained this high specificity through the use 
of N-acetyl-D-galactosamine as the carbohydrate moiety, which has higher affinity for the 
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receptor, compared to galactose. This "clustering effect" has also been described for the 
binding and uptake of mannosyl-terminating glycoproteins or glycoconjugates (Ponpipom 
et al, 1981, J, Med, Chem., 24, 1388-1395). The use of galactose and galactosamine 
based conjugates to transport exogenous compounds across cell membranes can provide a 
5 targeted delivery approach to the treatment of liver disease such as HBV infection or 
hepatocellular carcinoma. The use of bioconjugates can also provide a reduction in the 
required dose of therapeutic compounds required for treatment. Furthermore, therapeutic 
bioavialability, pharmacodynamics, and pharmacokinetic parameters can be modulated 
through the use of nucleic acid bioconjugates of the invention. 

10 Alternatively, certain of the nucleic acid molecules of the instant invention can be 

expressed within cells from eukaryotic promoters (e.g., Izant and Weintraub, 1985, 
Science, 229, 345; McGarry and Lindquist, 1986, Proc. Natl Acad, ScL, USA 83, 399; 
Scanlon et al, 1991, Proc, Natl Acad. Set USA, 88, 10591-5; Kashani-Sabet et al, 1992, 
Antisense Res, Dev., 2, 3-15; Dropulic et al, 1992, J. Virol, 66, 1432-41; Weerasinghe 

15 et al, 1991, J. Virol, 65, 5531-4; Ojwang et al, 1992, Proc. Natl Acad, Set USA, 89, 
10802-6; Chen et al, 1992, Nucleic Acids Res., 20,4581-9; Sarver et al, 1990 Science, 
247, 1222-1225; Thompson et al, 1995, Nucleic Acids Res., 23, 2259; Good et al, 1997, 
Gene Therapy, 4, 45; all of these references are hereby incorporated in their totalities by 
reference herein). Those skilled in the art realize that any nucleic acid can be expressed 

20 in eukaryotic cells from the appropriate DNA/RNA vector. The activity of such nucleic 
acids can be augmented by their release from the primary transcript by a ribozyme 
(Draper et al, PCT WO 93/23569, and Sullivan et al, PCT WO 94/02595; Ohkawa et al, 
1992, Nucleic Acids Symp, Ser., 27, 15-6; Taira et al, 1991, Nucleic Acids Res., 19, 
5125-30; Ventura et al, 1993, Nucleic Acids Res., 21, 3249-55; Chowrira et al, 1994, J. 

25 Biol Chem., 269, 25856; all of these references are hereby incorporated in their totality 
by reference herein). 

In another aspect of the invention, RNA molecules of the present invention are 
preferably expressed from transcription units (see, for example, Couture et al, 1996, 
TIG., 12, 510) inserted into DNA or RNA vectors. The recombinant vectors are 
30 preferably DNA plasmids or viral vectors. Ribozyme expressing viral vectors could be 
constructed based on, but not limited to, adeno-associated virus, retrovirus, adenovirus, or 
alphavirus. Preferably, the recombinant vectors capable of expressing the nucleic acid 
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molecules are delivered as described above, and persist in target cells. Alternatively, 
viral vectors may be used that provide for transient expression of nucleic acid molecules. 
Such vectors might be repeatedly administered as necessary. Once expressed, the nucleic 
acid molecule binds to the target mRNA. Delivery of nucleic acid molecule expressing 
vectors could be systemic, such as by intravenous or intra-muscular administration, by 
administration to target cells ex-planted from the patient followed by reintroduction into 
the patient, or by any other means that would allow for introduction into the desired target 
cell (for a review see Couture et al, 1996, TIG., 12, 510). 

In one aspect, the invention features an expression vector comprising a nucleic acid 
sequence encoding at least one of the nucleic acid molecules of the instant invention is 
disclosed. The nucleic acid sequence encoding the nucleic acid molecule of the instant 
invention is operable linked in a manner which allows expression of that nucleic acid 
molecule. 

In another aspect the invention features an expression vector comprising: a) a 
transcription initiation region (e.g., eukaryotic pol I, II or HI initiation region); b) a 
transcription termination region (e.g., eukaryotic pol I, II or III termination region); c) a 
nucleic acid sequence encoding at least one of the nucleic acid catalyst of the instant 
invention; and wherein said sequence is operably linked to said initiation region and said 
termination region in a manner which allows expression and/or delivery of said nucleic 
acid molecule. The vector can optionally include an open reading frame (ORF) for a 
protein operably linked on the 5' side or the 3'-side of the sequence encoding the nucleic 
acid catalyst of the invention; and/or an intron (intervening sequences). 

Transcription of the nucleic acid molecule sequences are driven from a promoter 
for eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA 
polymerase III (pol III). Transcripts from pol II or pol in promoters will be expressed at 
high levels in all cells; the levels of a given pol II promoter in a given cell type will 
depend on the nature of the gene regulatory sequences (enhancers, silencers, etc.) present 
nearby. Prokaryotic RNA polymerase promoters are also used, providing that the 
prokaryotic RNA polymerase enzyme is expressed in the appropriate cells (Elroy-Stein 
and Moss, 1990, Proc. Natl. Acad. Set USA, 87, 6743-7; Gao and Huang 1993, Nucleic 
Acids Res., 21, 2867-72; Lieber et al., 1993, Methods Enzymol, 217, 47-66; Zhou et al., 
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1990, Mol Cell Biol, 10, 4529-37). All of these references are incorporated by 
reference herein. Several investigators have demonstrated that nucleic acid molecules, 
such as ribozymes expressed from such promoters can function in mammalian cells (e.g. 
Kashani-Sabet et al., 1992, Antisense Res. Dev., 2, 3-15; Ojwang et aL, 1992, Proc. Natl 

5 Acad. Set USA, 89, 10802-6; Chen et al, 1992, Nucleic Acids Res., 20, 4581-9; Yu et 
al, 1993, Proc. Natl Acad. Set USA, 90, 6340-4; L'Huillier et al, 1992, EMBOJ., 11, 
4411-8; Lisziewicz et al, 1993, Proc. Natl Acad. Set U. S. A, 90, 8000-4; Thompson et 
al, 1995, Nucleic Acids Res., 23, 2259; Sullenger & Cech, 1993, Science, 262, 1566). 
More specifically, transcription units such as the ones derived from genes encoding U6 

10 small nuclear (snRNA), transfer RNA (tRNA) and adenovirus VA RNA are useful in 
generating high concentrations of desired RNA molecules such as ribozymes in cells 
(Thompson et al, supra; Couture and Stinchcomb, 1996, supra; Noonberg et al, 1994, 
Nucleic Acid Res., 22, 2830; Noonberg et al, US Patent No. 5,624,803; Good et al, 
1997, Gene Ther., 4, 45; Beigelman et al, International PCT Publication No. WO 

15 96/18736; all of these publications are incorporated by reference herein). The above 
ribozyme transcription units can be incorporated into a variety of vectors for introduction 
into mammalian cells, including but not restricted to, plasmid DNA vectors, viral DNA 
vectors (such as adenovirus or adeno-associated virus vectors), or viral RNA vectors 
(such as retroviral or alphavirus vectors) (for a review see Couture and Stinchcomb, 

20 1996, supra). 

In yet another aspect, the invention features an expression vector comprising 
nucleic acid sequence encoding at least one of the nucleic acid molecules of the invention 
in a manner that allows expression of that nucleic acid molecule. The expression vector 
comprises in one embodiment; a) a transcription initiation region; b) a transcription 

25 termination region; c) a nucleic acid sequence encoding at least one said nucleic acid 
molecule; and wherein said sequence is operably linked to said initiation region and said 
termination region in a manner which allows expression and/or delivery of said nucleic 
acid molecule. In another embodiment, the expression vector comprises: a) a 
transcription initiation region; b) a transcription termination region; c) an open reading 

30 frame; d) a nucleic acid sequence encoding at least one said nucleic acid molecule, 
wherein said sequence is operably linked to the 3'-end of said open reading frame and 
wherein said sequence is operably linked to said initiation region, said open reading frame 
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and said termination region in a manner which allows expression and/or delivery of said 
nucleic acid molecule. In yet another embodiment, the expression vector comprises: a) a 
transcription initiation region; b) a transcription termination region; c) an intron; d) a 
nucleic acid sequence encoding at least one said nucleic acid molecule and wherein said 

5 sequence is operably linked to said initiation region, said intron and said termination 
region in a manner which allows expression and/or delivery of said nucleic acid molecule. 
In another embodiment, the expression vector comprises: a) a transcription initiation 
region; b) a transcription termination region; c) an intron; d) an open reading frame; e) a 
nucleic acid sequence encoding at least one said nucleic acid molecule, wherein said 

10 sequence is operably linked to the 3 '-end of said open reading frame and wherein said 
sequence is operably linked to said initiation region, said intron, said open reading frame 
and said termination region in a manner which allows expression and/or delivery of said 
nucleic acid molecule. 

Examples : 

15 The following are non-limiting examples showing the selection, isolation, synthesis 

and activity of nucleic acids of the instant invention. 

Exam ple 1: Identification of Aptamers that specifically bind to HIV gp41 

A nucleic acid aptamer that selectively binds HIV gp41 is provided in accordance 
with the present invention. The binding affinity of the aptamer for HIV gp41 is 
20 preferably represented by the dissociation constant of about 20 nanomolar (nM) or less, 
and more preferably about 10 nM or less. In one embodiment, the Kd of the aptamer and 
gp41 target is established using a double filter nitrocellulose filter binding assay such as 
that disclosed by Wong and Lohman, 1993, PNAS USA, 90, 5428-5432. 

Generally, the method for isolating aptamers of the invention having specificity for 
25 HIV gp41 comprises: (a) preparing a candidate mixture of potential oligonucleotide 
ligands for gp41 wherein the candidate mixture is complex enough to contain at least one 
oligonucleotide ligand for gp41 or a peptide derivative thereof (the gp41 target); (b) 
contacting the candidate mixture with the gp41 target under conditions suitable for at 
least one oligonucleotide in the candidate mixture to bind to the gp41 target; (c) removing 
30 unbound oligonucleotides from the candidate mixture; (d) collecting the oligonucleotide 
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ligands that are bound to the gp41 target to produce a first collected mixture of 
oligonucleotide ligands; (e) contacting the mixture from (d) with the gp41 target under 
more stringent binding conditions than in (b), wherein oligonucleotide ligands having 
increased affinity to the gp41 target relative to the first collected mixture of (d); (f) 
removing unbound oligonucleotides from (e); and (g) collecting the oligonucleotide 
ligands that are bound to the gp41 target to produce a second collected mixture of 
oligonucleotide ligands to thereby identify oligonucleotides having specificity for HIV 
gp41. The method can comprise additional steps in which the oligonucleotides isolated in 
the first or second collected mixture are enriched or expanded by any suitable technique, 
such as amplification or mutagenesis, prior to contacting the first collected 
oligonucleotide mixture with the target under the higher stringency conditions, after 
collecting the oligonucleotides that bound to the target under the higher stringency 
conditions, or both. Optionally, the contacting and expanding or enriching steps are 
repeated as necessary to produce the desired aptamer. Thus, it is possible that the second 
collected oligonucleotide mixture can comprise a single aptamer. The conditions used to 
affect the stringency of binding used in the method can include varying reaction 
conditions used for binding, for example the composition of a buffer, temperature, time, 
and concentration of the components used for binding can be optimized for the desired 
level of stringency. 

In vitro Selection 

hi a non-limiting example, aptamers having binding specificity for a HIV-1 gp41 
target are isolated by applying the method under the following conditions. First, the gp41 
target is attached to a solid matrix such as a bead or chip surface by means of a covalent 
(eg. amide or morpholino bond) or non-covalent (eg. biotin/streptavidin) linkage. The 
gp41 target can comprise the entire isolated gp41 subunit of HIV envelope glycoprotein 
or an isolated peptide sequence derived therefrom, such as a peptide having SEQ ID NOs. 
1233 and/or 1234. The isolated peptide sequence can be synthesized or isolated by 
protein digest. 

A random pool of DNA oligomers is synthesized where the 5* and 3' proximal ends 
are fixed sequences used for amplification and the central region consists of randomized 
positions. Ten picomoles of template are PCR amplified for 8 cycles in the initial round. 
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Copy DNA of the selected pool of RNA from subsequent rounds of amplification are 
PCR amplified 18 cycles. PCR reactions are carried out in a 50 .mu.l volume containing 
200 picomoles of each primer, 2 mM final concentration dNTP's, 5 units of Thermus 
aquaticus DNA polymerase (Perkin Elmer Cetus) in a PCR buffer (10 mM Tris-Cl pH 
8.4, 50 mM KC1, 7.5 mM MgCl.sub.2, 0.05 mg/ml BSA). Primers are annealed at 
58.degree. C. for 20 seconds and extended at 74.degree. C. for 2 minutes. Denaturation 
can occur at 93°C for 30 seconds. 

Products from PCR amplification are used for T7 in vitro transcription in a 200 
.mu.l reaction volume. T7 transcripts, are purified from an 8 percent, 7M Urea 
polyacrylamide gel and eluted by crushing gel pieces in a Sodium Acetate/EDTA 
solution. For each round of amplification, 50 picomoles of the selected pool of RNA is 
phosphatased for 30 minutes using Calf Intestinal Alkaline Phosphatase. The reaction is 
then phenol extracted 3 times and chloroform extracted once, then ethanol precipitated. 
25 picomoles of this RNA is 5' end-labeled using .gamma 32 P ATP with T4 
polynucleotide kinase for 30 minutes. Kinased RNA is gel purified and a small quantity 
(about 150 ftnoles; 100,000 cpm) is used along with 250 picomoles of cold RNA to 
follow the fraction of RNA bound to gp41 and retained on nitrocellulose filters during the 
separation step of the method. Typically a protein concentration is used that binds one to 
five percent of the total input RNA. A control (without protein) is used to determine the 
background which is typically 0.1% of the total input. Selected RNA is eluted from the 
filter by extracting three times with water saturated phenol containing 2% lauryl sulfate 
(SDS), 0.3M NaOAc and 5 mM EDTA followed by a chloroform extraction. Twenty five 
percent of this RNA is then used to synthesize cDNA for PCR amplification. 

Selection with Non-Amplifiable Competitor RNA 

In a non-limiting example, selections are performed using two buffer conditions 
where the only difference between the buffers is sodium concentration (250 mM NaCl or 
500 mM NaCl). Two different buffer conditions are used to increase stringency (with the 
higher salt concentration being more stringent) and to determine whether different ligands 
can be obtained. After 10 rounds of amplification, the binding constant of the selected 
pool can decrease by about an order of magnitude and can remain constant for the next 
two additional rounds. Competitor RNA is not used in the first 12 rounds. After this 
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round, the pool is split and selection carried out in the presence and absence (control) of 
competitor RNA. For rounds 12 through 18, a 50-fold excess of a non-amplifiable 
random pool of RNA is present during selection to compete with non-specific low- 
affinity binders that may survive and thus be amplified. The competitor RNA, which had 
a 30N random region, is made as described above for the amplifiable pool RNA; 
however, the competitor RNA has different primer annealing sequences. Thus, the 
competitor RNA does not survive the cDNA synthesis or PCR amplification steps. It 
would be apparent to one skilled in the art that other primer sequences could be used as 
long as they are not homologous to those used for the pool RNA. The use of competitor 
RNA can increase the affinity of the selected pool by several orders of magnitude. 

Cloning and Sequencing 

In a non-limiting example, PCR amplified DNA from the last round selected-pool 
of RNA is phenol and chloroform extracted and ethanol precipitated. The extracted PCR 
DNA is then digested using Bam HI and Hind HI restriction enzymes and sub-cloned into 
pUC18. DNAs are phenol and chloroform extracted following digestion. Ligation is 
carried out at room temperature for two hours after which time the reaction is phenol and 
chloroform extracted and used to electroporate competent cells. Fifty transformants from 
the selections using competitor RNA at both NaCl concentrations are picked and their 
DNAs sequenced. 

Binding Assays 

In a non-limiting example, binding assays are performed by adding 5 .mu.l of HIV- 

1 gp41 protein, at the appropriate concentrations (i.e., ranging from 2 x 10" 6 with 3 fold 
dilutions to 9 x 10" 9 for 250 mM NaCl and 0.5 x 10" 7 with 3 fold dilutions to 2 x lO" 10 
for 50 mM NaCl), to 45 ul of binding buffer (50 mM Na-HEPES pH 7.5, 250 mM NaCl, 

2 mM DTT, 10 mM MnCl2, 5 mM CHAPS) on ice, then adding 50,000 cpm of kinased 
RNA (<200 finoles) in a volume of 3 to 4 .mu.l. This mix is incubated at 37°C for 20 
minutes. The reactions are then passed over nitrocellulose filters, which are pre- 
equilibrated in buffer, and washed with a 50 mM Tris-Cl pH 7.5 solution. Filters are dried 
and counted. 
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General Considerations in Aptamer Selection 

When a consensus sequence is identified, oligonucleotides that contain that 
sequence can be made by conventional synthetic or recombinant techniques. These 
aptamers can also function as target-specific aptamers of this invention. Such an aptamer 

5 can conserve the entire nucleotide sequence of an isolated aptamer, or can contain one or 
more additions, deletions or substitutions in the nucleotide sequence, as long as a 
consensus sequence is conserved. A mixture of such aptamers can also function as target- 
specific aptamers, wherein the mixture is a set of aptamers with a portion or portions of 
their nucleotide sequence being random or varying, and a conserved region that contains 

10 the consensus sequence. Additionally, secondary aptamers can be synthesized using one 
or more of the modified bases, sugars and linkages described herein using conventional 
techniques and those described herein. 

In some embodiments of this invention, aptamers can be sequenced or mutagenized 
to identify consensus regions or domains that are participating in aptamer binding to 

15 target, and/or aptamer structure. This information is used for generating second and 
subsequent pools of aptamers of partially known or predetermined sequence. Sequencing 
used alone or in combination with the retention and selection processes of this invention, 
can be used to generate less diverse oligonucleotide pools from which aptamers can be 
made. Further selection according to these methods can be carried out to generate 

20 aptamers having preferred characteristics for diagnostic or therapeutic applications. That 
is, domains that facilitate, for example, drug delivery could be engineered into the 
aptamers selected according to this invention. 

Although this invention is directed to making aptamers using screening from pools 
of non-predetermined sequences of oligonucleotides, it also can be used to make second- 

25 generation aptamers from pools of known or partially known sequences of 
oligonucleotides. A pool is considered diverse even if one or both ends of the 
oligonucleotides comprising it are not identical from one oligonucleotide pool member to 
another, or if one or both ends of the oligonucleotides comprising the pool are identical 
with non-identical intermediate regions from one pool member to another. Toward this 

30 objective, knowledge of the structure and organization of the target protein can be useful 
to distinguish features that are important for biochemical pathway inhibition or biological 
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response generation in the first generation aptamers. Structural features can be considered 
in generating a second (less random) pool of oligonucleotides for generating second 
round aptamers: 

Those skilled in the art will appreciate that comparisons of the complete or partial 
amino acid sequences of the purified protein target to identify variable and conserved 
regions is useful. Comparison of sequences of aptamers made according to this invention 
provides information about the consensus regions and consensus sequences responsible 
for binding. It is expected that certain nucleotides will be rigidly specified and certain 
positions will exclusively require certain bases. Likewise, studying localized regions of a 
protein to identify secondary structure can be useful. Localized regions of a protein can 
adopt a number of different conformations including beta strands, alpha helices, turns 
(induced principally by proline or glycine residues) or random structure. Different regions 
of a polypeptide interact with each other through hydrophobic and electrostatic 
interactions and also by formation of salt bridges, disulfide bridges, etc. to form the 
secondary and tertiary structures. Defined conformations can be formed within the protein 
organization, including beta sheets, beta barrels, and clusters of alpha helices. 

It sometimes is possible to determine the shape of a protein target or portion thereof 
by crystallography X ray diffraction or by other physical or chemical techniques known 
to those skilled in the art. Many different computer programs are available for predicting 
protein secondary and tertiary structure, the most common being those described in Chou 
and Fasman, 1978, Biochemistry, 13, 222-245, and Gamier et al, 1978, J. Mol BioL, 120, 
97-120. Generally, these and other available programs are based on the physical and 
chemical properties of individual amino acids (hydrophobicity, size, charge and presence 
of side chains) and on the amino acids' collective tendency to form identifiable structures 
in proteins whose secondary structure has been determined. Many programs attempt to 
weight structural data with their known influences. For example, amino acids such as 
proline or glycine are often present where polypeptides have share turns. Long stretches 
of hydrophobic amino acids (as determined by hydropathy plot), usually have a strong 
affinity for lipids. 

Data obtained by the methods described above and by other conventional methods 
and tools can be correlated with the presence of particular sequences of nucleotides in the 
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first and second generation aptamers to engineer second and third generation aptamers. 
Further, according to this invention, second generation aptamers can be identified simply 
by sequentially screening from pools of oligonucleotides having more predetermined 
sequences than the pools used in earlier rounds of selection. 

5 These methods can be used to design optimal binding sequences for any desired 

protein target (which can be portions of aptamers or entire aptamers) and/or to engineer 
into aptamers any number of desired targeted functions or features. Optimal binding 
sequences are those which exhibit high relative affinity for target, i.e., affinity measured 
in Kd in at least in the nanomolar range, and, for certain drug applications, the nanomolar 

10 or picomolar range. In practicing this invention, studying the binding energies of 
aptamers using standard methods known generally in the art are useful. Generally, 
consensus regions can be identified by comparing the conservation of nucleotides for 
appreciable enhancement in binding. 

Structural knowledge can be used to engineer aptamers made according to this 
15 invention. For example, stem structures in the aptamer pool can be vital components in 
some embodiments where increased aptamer rigidity is desired. According to the 
teachings of the instant invention, a randomly generated pool of oligonucleotides having 
the stem sequences can be generated. After aptamers are identified which contain the 
stem (i.e., use the stem in primers), cross-linkers can be introduced into the stem to 
20 covalently fix the stem in the aptamer structure. Cross-linkers also can be used to fix an 
aptamer to a target. Once an aptamer has been identified, it can be used, either by linkage 
to, or use in combination with, other aptamers identified according to these methods. One 
or more aptamers can be used in this manner to bind to one or more targets. 

Techniques used in optimizing Aptamer binding 

25 In order to produce nucleic acid aptamers desirable for use as a pharmaceutical 

composition, it is desirable that the nucleic acid aptamer have the following 
characteristics: (1) the nucleic acid aptamer binds to the target in a manner capable of 
achieving the desired effect on the target; (2) be as small as possible to obtain the desired 
effect; (3) be as stable as possible; and (4) be a specific ligand to the chosen target. In 

30 most, if not all, situations it is preferred that the nucleic acid ligand has the highest 
possible affinity to the target. Modifications or derivatizations of the ligand that confer 
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resistance to degradation and clearance in situ during therapy, the capability to cross 
various tissue or cell membrane barriers, or any other accessory properties that do not 
significantly interfere with affinity for the target molecule can also be provided as 
improvements. 

One of the uses of nucleic acid ligands derived by in vitro selection or another 
approach is to find ligands that alter target molecule function. Thus, it is a good procedure 
to first assay for inhibition or enhancement of function of the target protein. One could 
even perform such functional tests of the combined ligand pool prior to cloning and 
sequencing. Assays for the biological function of the chosen target are generally available 
and known to those skilled in the art, and can be easily performed in the presence of the 
nucleic acid ligand to determine if inhibition occurs. 

Enrichment can supply a number of cloned ligands of probable variable affinity for 
the target molecule. Sequence comparisons can yield consensus secondary structures and 
primary sequences that allow grouping of the ligand sequences into motifs. Although a 
single ligand sequence (with some mutations) can be found frequently in the total 
population of cloned sequences, the degree of representation of a single ligand sequence 
in the cloned population of ligand sequences cannot absolutely correlate with affinity for 
the target molecule. Therefore mere abundance is not the sole criterion for judging 
"winners" after in vitro selection and binding assays for various ligand sequences 
(adequately defining each motif that is discovered by sequence analysis) are required to 
weigh the significance of the consensus arrived at by sequence comparisons. The 
combination of sequence comparison and affinity assays should guide the selection of 
candidates for more extensive ligand characterization. 

An important step for determining the length of sequence relevant to specific 
affinity is to establish the boundaries of that information within a ligand sequence. This is 
conveniently accomplished by selecting end-labeled fragments from hydrolyzed pools of 
the ligand of interest so that 5' and 3' boundaries of the information can be discovered. To 
determine a 3' boundary, one can perform a large-scale in vitro transcription of the 
amplified aptamer sequence, gel purify the RNA using UV shadowing on an intensifying 
screen, phosphatasing the purified RNA, phenol extracting extensively, labeling by kinase 
reactions with 32P, and gel purification of the labeled product (for example by using a 
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film of the gel as a guide). The resultant product can then be subjected to pilot partial 
digestions with RNase Tl (varying enzyme concentration and time, at 50°C in a buffer of 
7M urea, 50 mM sodium citrate pH 5.2) and alkaline hydrolysis (at 50 mM NaC03, 
adjusted to pH 9.0 by prior mixing of 1 M bicarbonate and carbonate solutions; test over 

5 ranges of 20 to 60 minutes at 95°C). Once optimal conditions for alkaline hydrolysis are 
established (so that there is an even distribution of small to larger fragments) one can 
scale up to provide enough material for selection by the target (for example on 
nitrocellulose filters). Binding assays can the be set up, which vary target protein 
concentration from the lowest saturating protein concentration to that protein 

10 concentration at which approximately 10% of RNA is bound as determined by the 
binding assays for the ligand. One can vary target concentration by increasing volume 
rather than decreasing the absolute amount of target; this provides a good signal to noise 
ratio as the amount of RNA bound to the filter is limited by the absolute amount of target. 
The RNA is eluted as, for example, in in vitro selection and then run on a denaturing gel 

15 with Tl partial digests so that the positions of hydrolysis bands can be related to the 
ligand sequence. 

The 5' boundary can be similarly determined. Large-scale in vitro transcriptions are 
purified as described herein. There are two methods for labeling the 3' end of the RNA. 
One method is to kinase Cp with 32P (or purchase 32P-Cp) and ligate to the purified 

20 RNA with RNA ligase. The labeled RNA is then purified and subjected to very identical 
protocols. An alternative is to subject unlabeled RNAs to partial alkaline hydrolyses and 
extend an annealed, labeled primer with reverse transcriptase as the assay for band 
positions. One of the advantages over pCp labeling is the ease of the procedure, the more 
complete sequencing ladder (by dideoxy chain termination sequencing) with which one 

25 can correlate the boundary, and increased yield of assayable product. A disadvantage is 
that the extension on eluted RNA sometimes contains artifactual stops, so it can be 
important to control by spotting and eluting starting material on nitrocellulose filters 
without washes and assaying as the input RNA. Using techniques as described herein, it 
is possible to find the boundaries of the sequence information required for high affinity 

30 binding to the target. 
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Assessment of Nucleotide Contributions to Aptamer Target Binding Affinity 



Once a minimal high affinity ligand sequence is identified, the sequence can be 
used to identify the nucleotides within the boundaries that are critical to the interaction 
with the target molecule. One method is to create a new random template in which all of 

5 the nucleotides of a high affinity ligand sequence are partially randomized or blocks of 
randomness are interspersed with blocks of complete randomness for use in an in vitro 
selection method for example, preferably a modified in vitro selection method as 
disclosed herein. Such "secondary" in vitro selections produce a pool of ligand sequences 
in which critical nucleotides or structures are absolutely conserved, less critical features 

10 preferred, and unimportant positions unbiased. Secondary in vitro selections can thus help 
to further elaborate a consensus that is based on relatively few ligand sequences. In 
addition, even higher-affinity ligands can be provided whose sequences were unexplored 
in the original in vitro selection. 

Another method is to test oligo-transcribed variants (i.e. nucleotide substitution) 
15 where the consensus sequence can be unclear. An additional useful set of techniques are 
inclusively described as chemical modification experiments. Such experiments can be 
used to probe the native structure of RNAs, by comparing modification patterns of 
denatured and non-denatured states. The chemical modification pattern of an RNA ligand 
that is subsequently bound by target molecule can be different from the native pattern, 
20 indicating potential changes in structure upon binding or protection of groups by the 
target molecule. In addition, RNA ligands can fail to be bound by the target molecule 
when modified at positions critical to either the bound structure of the ligand or critical to 
interaction with the target molecule. Such experiments in which these positions are 
identified are described as "chemical modification interference" experiments. 

25 There are a variety of available reagents to conduct such experiments that are 

known to those skilled in the art (see for example, Ehresmann et al t 1987, Nuc. Acids. 
Res., 15, 9109-9128. Chemicals that modify bases can be used to modify ligand RNAs. A 
pool is bound to the target at varying concentrations and the bound RNAs recovered (such 
as in the boundary experiments) and the eluted RNAs analyzed for the modification. An 

30 assay can be by subsequent modification-dependent base removal and aniline scission at 
the baseless position or by reverse transcription assay of sensitive (modified) positions. In 
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such assays, bands (indicating modified bases) in unselected RNAs, appear that disappear 
relative to other bands in target protein-selected RNAs. Similar chemical modifications 
with ethyl nitrosourea, or via mixed chemical or enzymatic synthesis with, for example, 
2-methoxys on ribose or phosphorothioates can be used to identify essential atomic 
groups on the oligonucleotide backbone. In experiments with 2'-methoxy versus 2'-OH 
mixtures, the presence of an essential OH group can result in enhanced hydrolysis relative 
to other positions in molecules that have been stringently selected by the target. 

Comparisons of the intensity of bands for bound and unbound ligands can reveal 
not only modifications that interfere with binding, but also modifications that enhance 
binding. A ligand can be made with precisely that modification and tested for the 
enhanced affinity. Thus chemical modification experiments can be a method for exploring 
additional local contacts with the target molecule, just as walking experiments (see 
below) are for additional nucleotide level contacts with adjacent domains. 

A consensus of primary and secondary structures that enables the chemical or 
enzymatic synthesis of oligonucleotide ligands whose design is based on that consensus is 
provided herein via an in vitro selection method, preferably a modified in vitro selection 
method as disclosed herein. Because the replication machinery of in vitro selection 
requires that rather limited variation at the subunit level (ribonucleotides, for example), 
such ligands imperfectly fill the available atomic space of a target molecule's binding 
surface. However, these ligands can be thought of as high-affinity scaffolds that can be 
derivatized to make additional contacts with the target molecule. In addition, the 
consensus contains atomic group descriptors that are pertinent to binding and atomic 
group descriptors that are coincidental to the pertinent atomic group interactions. Such 
derivatization does not exclude incorporation of cross-linking agents that will give 
specifically directly covalent linkages to the target protein. Such derivatization analyses 
can be performed at but are not limited to the 2' position of the ribose, and thus can also 
include derivatization at any position in the base or backbone of the nucleotide ligand. 

The present invention thus includes nucleic acid ligands wherein certain chemical 
modifications have been made in order to increase the in vivo stability of the ligand or to 
enhance or mediate the delivery of the ligand. Examples of such modifications include 
chemical substitutions at the ribose and/or phosphate positions of a given RNA sequence. 
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A logical extension of this analysis is a situation in which one or a few nucleotides of the 
polymeric ligand are used as a site for chemical derivative exploration. The rest of the 
ligand serves to anchor in place this monomer (or monomers) on which a variety of 
derivatives are tested for non-interference with binding and for enhanced affinity. Such 

5 explorations can result in small molecules that mimic the structure of the initial ligand 
framework, and have significant and specific affinity for the target molecule independent 
of that nucleic acid framework. Such derivatized subunits, which can have advantages 
with respect to mass production, therapeutic routes of administration, delivery, clearance 
or degradation than the initial ligand, can become the therapeutic and can retain very little 

10 of the original ligand. Thus, the aptamer ligands of the present invention can allow 
directed chemical exploration of a defined site on the target molecule known to be 
important for the target function. 

Walking Experiments 

After a minimal consensus ligand sequence has been determined for a given target, 
15 it is possible to add random sequence to the minimal consensus ligand sequence and 
evolve additional contacts with the target, perhaps to separate but adjacent domains. This 
procedure has been referred to in the art as "walking". A walking experiment can involve 
two experiments performed sequentially. A new candidate mixture is produced in which 
each of the members of the candidate mixture has a fixed nucleic acid region that 
20 corresponds to a nucleic acid ligand of interest. Each member of the candidate mixture 
also contains a randomized region of sequences. According to this method it is possible to 
identify what are referred to as "extended" nucleic acid ligands, which contain regions 
that can bind to more than one binding domain of a target. 

Covariance Analysis 

25 In conjunction with empirical methods for determining the three dimensional 

structure of nucleic acids, computer modeling methods for determining structure of 
nucleic acid ligands can also be employed. Secondary structure prediction is a useful 
guide to correct sequence alignment. It is also a highly useful stepping-stone to correct 
3D structure prediction, by constraining a number of bases into A-form helical geometry. 
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Tables of energy parameters for calculating the stability of secondary structures 
exist. Although early secondary structure prediction programs attempted to simply 
maximize the number of base-pairs formed by a sequence, most current programs seek to 
find structures with minimal free energy as calculated by these thermodynamic 

5 parameters. There are two problems in this approach that should be borne in mind. First, 
the thermodynamic rales are inherently inaccurate, typically to 10% or so, and there are 
many different possible structures lying within 10% of the global energy minimum. 
Second, the actual secondary structure need not lie at a global energy minimum, 
depending on the kinetics of folding and synthesis of the sequence. Nonetheless, for short 

10 sequences, these caveats are of minor importance because there are so few possible 
structures that can form. 

The brute force predictive method is a dot-plot: make an N by N plot of the 
sequence against itself, and mark an X everywhere a base pair is possible. Diagonal runs 
of X's mark the location of possible helices. Exhaustive tree-searching methods can then 

15 search for all possible arrangements of compatible (i.e., non-overlapping) helices of 
length L or more; energy calculations can be done for these structures to rank them as 
more or less likely. The advantages of this method are that all possible topologies, 
including pseudoknotted conformations, can be examined, and that a number of 
suboptimal structures are automatically generated as well. An elegant predictive method, 

20 and currently the most used, is the Zuker program. Zuker, 1989, Science, 244, 48-52. 
Originally based on an algorithm developed by Ruth Nussinov, the Zuker program makes 
a major simplifying assumption that no pseudoknotted conformations will be allowed. 
This permits the use of a dynamic programming approach that runs in time proportional 
to only N3 to N4, where N is the length of the sequence. The Zuker program is the only 

25 program capable of rigorously dealing with sequences of than a few hundred nucleotides, 
so it has come to be the most commonly used by biologists. However, the inability of the 
Zuker program to predict pseudoknotted conformations is a serious consideration. Where 
pseudoknotted RNA structures are suspected or are recognized by eye, a brute-force 
method capable of predicting pseudoknotted conformations should be employed. 

30 A central element of comparative sequence analysis is sequence covariations. A 

covariation is when the identity of one position depends on the identity of another 
position; for instance, a required Watson-Crick base pair shows strong covariation in that 
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knowledge of one of the two positions gives absolute knowledge of the identity at the 
other position. Covariation analysis has been used previously to predict the secondary 
structure of RNAs for which a number of related sequences sharing a common structure 
exist, such as tRNA, rRNAs, and group I introns. It is now apparent that covariation 
analysis can be used to detect tertiary contacts as well. Stormo and Gutell, 1992, Nucleic 
Acids Research, 29, 5785-5795 have designed and implemented an algorithm that 
precisely measures the amount of covariations between two positions in an aligned 
sequence set. The program is called "MIXY" -Mutual Information at position X and Y. 
Consider an aligned sequence set. In each column or position, the frequency of 
occurrence of A, C, G, U, and gaps is calculated. This frequency is called f(bx), the 
frequency of base b in column x. Considering two columns at once, the frequency that a 
given base b appears in column x is f(bx) and the frequency that a given base b appears in 
column y is f(by). If position x and position y do not care about each other's identity-that 
is, the positions are independent; there is no covariation-the frequency of observing bases 
bx and by at position x and y in any given sequence should be just f(bxby)=f(bx)f(by). If 
there are substantial deviations of the observed frequencies of pairs from their expected 
frequencies, the positions are said to covary. 

The amount of deviation from expectation can be quantified with an information 
measure M(x,y), the mutual information of x and y. 

f(b x b v ) 

M(x,y) = S f(b x b y )In 

b x b y y f(bx)f(b y ) 

M(x,y) can be described as the number of bits of information one learns about the 
identity of position y from knowing just the identity of position x. If there is no 
covariation, M(x,y) is zero; larger values of M(x,y) indicate strong covariation. 
Covariation values can be used to develop three-dimensional structural predictions. 

In some ways, the problem is similar to that of structure determination by NMR. 
Unlike crystallography, which in the end yields an actual electron density map, NMR 
yields a set of interatomic distances. Depending on the number of interatomic distances 
one can get, there can be one, few, or many 3D structures with which they are consistent. 
Mathematical techniques had to be developed to transform a matrix of interatomic 



73 



WO 03/102131 PCT/US03/12626 

distances into a structure in 3D space. The two main techniques in use are distance 
geometry and restrained molecular dynamics. 

Distance geometry is the more formal and purely mathematical technique. The 
interatomic distances are considered to be coordinates in an N-dimensional space, where 

5 N is the number of atoms. In other words, the "position" of an atom is specified by N 
distances to all the other atoms, instead of the three (x,y,z) coordinates typically 
considered. Interatomic distances between every atom are recorded in an N-by-N distance 
matrix. A complete and precise distance matrix is easily transformed into a 3 by N 
Cartesian coordinates, using matrix algebra operations. The trick of distance geometry as 

10 applied to NMR is dealing with incomplete (only some of the interatomic distances are 
known) and imprecise data (distances are known to a precision of only a few angstroms at 
best). Much of the time of distance geometry-based structure calculation is thus spent in 
pre-processing the distance matrix, calculating bounds for the unknown distance values 
based on the known ones, and narrowing the bounds on the known ones. Usually, 

15 multiple structures are extracted from the distance matrix that are consistent with a set of 
NMR data; if they all overlap nicely, the data were sufficient to determine a unique 
structure. Unlike NMR structure determination, covariance gives only imprecise distance 
values; but also only probabilistic rather than absolute knowledge about whether a given 
distance constraint should be applied. 

20 Restrained molecular dynamics can also be employed, albeit in a more ad hoc 

manner. Given an empirical force field that attempts to describe the forces that all the 
atoms feel (van der Waals, covalent bonding lengths and angles, electrostatics), one can 
simulate a number of femtosecond time steps of a molecule's motion, by assigning every 
atom at a random velocity (from the Boltzmann distribution at a given temperature) and 

25 calculating each atom's motion for a femtosecond using Newtonian dynamical equations; 
that is "molecular dynamics". In restrained molecular dynamics, one assigns extra ad hoc 
forces to the atoms when they violate specified distance bounds. 

With respect to RNA aptamers, the probabilistic nature of data with restrained 
molecular dynamics can be addressed. The covariation values can be transformed into 
30 artificial restraining forces between certain atoms for certain distance bounds; varying the 
magnitude of the force according to the magnitude of the covariance. NMR and 
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covariance analysis generates distance restraints between atoms or positions, which are 
readily transformed into structures through distance geometry or restrained molecular 
dynamics. Another source of experimental data which can be utilized to determine the 
three dimensional structures of nucleic acids is chemical and enzymatic protection 
5 experiments, which generate solvent accessibility restraints for individual atoms or 
positions. 

Example 2: Nucleic Acid Molecules for Modulating HIV env Gene Expression 

The following examples demonstrate the selection and design of Enzymatic Nucleic 
Acid (hammerhead, DNAzyme, NCH, Amberzyme, Zinzyme, or G-Cleaver), Antisense, 
10 and siRNA molecules and binding/cleavage sites within HIV RNA. 

Identification of Potential Target Sites in Human HIV RNA 

The sequence of human HIV genes are screened for accessible sites using a 
computer-folding algorithm. Regions of the RNA that do not form secondary folding 
structures and contained potential enzymatic nucleic acid molecule and/or antisense 
15 binding/cleavage sites are identified. The sequences of these binding/cleavage sites are 
shown in Tables III to XI. 

Example 2: Selection of Enzymatic Nucleic Acid Cleavage Sites in Human HIV RNA 

Enzymatic nucleic acid molecule target sites are chosen by analyzing sequences of 
Human HIV (Genbank. accession No: NM_005228) and prioritizing the sites on the basis 

20 of folding. Enzymatic nucleic acid molecules are designed that can bind each target and 
are individually analyzed by computer folding (Christoffersen et al, 1994 J. Mol Struc. 
Theochem, 311, 273; Jaeger et al, 1989, Proa Natl. Acad. Sci. USA, 86, 7706) to assess 
whether the enzymatic nucleic acid molecule sequences fold into the appropriate 
secondary structure. Those enzymatic nucleic acid molecules with unfavorable 

25 intramolecular interactions between the binding arms and the catalytic core are eliminated 
from consideration. As noted below, varying binding arm lengths can be chosen to 
optimize activity. Generally, at least 5 bases on each arm are able to bind to, or otherwise 
interact with, the target RNA. 
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Example 3: Chemical Synthesis and Purification of Ribozvmes and Antisense for 
Efficient Cleavage and/or blocking of HIV RNA 

Enzymatic nucleic acid molecules and antisense constructs are designed to anneal 
to various sites in the RNA message. The binding arms of the enzymatic nucleic acid 
5 molecules are complementary to the target site sequences described above, while the 
antisense constructs are fully complementary to the target site sequences described above. 
The enzymatic nucleic acid molecules and antisense constructs were chemically 
synthesized. The method of synthesis used followed the procedure for normal RNA 
synthesis as described above and in Usman et aL, (1987 J. Am. Chem. Soc., 109, 7845), 
10 Scaringe et al, (1990 Nucleic Acids Res., 18, 5433) and Wincott et al t supra, and made 
use of common nucleic acid protecting and coupling groups, such as dimethoxytrityl at 
the 5'-end, and phosphoramidites at the 3'-end. The average stepwise coupling yields 
were typically >98%. 

Enzymatic nucleic acid molecules and antisense constructs are also synthesized 
from DNA templates using bacteriophage T7 RNA polymerase (Milligan and Uhlenbeck, 
1989, Methods EnzymoL 180, 51). Enzymatic nucleic acid molecules and antisense 
constructs are purified by gel electrophoresis using general methods or are purified by 
high pressure liquid chromatography (HPLC; See Wincott et aL, supra; the totality of 
which is hereby incorporated herein by reference) and are resuspended in water. The 
sequences of the chemically enzymatic nucleic acid molecules used in this study are 
shown below in Tables VI to IX. The sequences of the antisense constructs used in this 
study are shown in Table X. The sequences of the siRNA constructs used in this study 
are shown in Table XI. 

Example 4: Enzymatic nucleic acid molecule Cleavage of HIV RNA Target in vitro 

25 Enzymatic nucleic acid molecules targeted to the human HIV RNA are designed 

and synthesized as described above. These enzymatic nucleic acid molecules can be 
tested for cleavage activity in vitro, for example, using the following procedure. The 
target sequences and the nucleotide location within the HIV RNA are given in Tables III 
to IX. 
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Cleavage Reactions: Full-length or partially full-length, internally-labeled target 
RNA for enzymatic nucleic acid molecule cleavage assay is prepared by in vitro 
transcription in the presence of [a- 32 p] CTP, passed over a G 50 Sephadex column by 
spin chromatography and used as substrate RNA without further purification. 
Alternately, substrates are 5'- 32 P-end labeled using T4 polynucleotide kinase enzyme. 
Assays are performed by pre-warming a 2X concentration of purified enzymatic nucleic 
acid molecule in enzymatic nucleic acid molecule cleavage buffer (50 mM Tris-HCl, pH 
7.5 at 37°C, 10 mM MgCl2) and the cleavage reaction was initiated by adding the 2X 
enzymatic nucleic acid molecule mix to an equal volume of substrate RNA (maximum of 
1-5 nM) that was also pre-warmed in cleavage buffer. As an initial screen, assays are 
carried out for 1 hour at 37°C using a final concentration of either 40 nM or 1 mM 
enzymatic nucleic acid molecule, i.e., enzymatic nucleic acid molecule excess. The 
reaction is quenched by the addition of an equal volume of 95% formamide, 20 mM 
EDTA, 0.05% bromophenol blue and 0.05% xylene cyanol after which the sample is 
heated to 95°C for 2 minutes, quick chilled and loaded onto a denaturing polyacrylamide 
gel Substrate RNA and the specific RNA cleavage products generated by enzymatic 
nucleic acid molecule cleavage are visualized on an autoradiograph of the gel. The 
percentage of cleavage is determined by Phosphor Imager® quantitation of bands 
representing the intact substrate and the cleavage products. 

Indications 

Particular degenerative and disease states that can be associated with HIV 
expression modulation include but are not limited to acquired immunodeficiency disease 
(AIDS) and related diseases and conditions, including but not limited to Kaposi's 
sarcoma, lymphoma, cervical cancer, squamous cell carcinoma, cardiac myopathy, 
rheumatic diseases, and opportunistic infection, for example Pneumocystis carinii, 
Cytomegalovirus, Herpes simplex, Mycobacteria, Cryptococcus, Toxoplasma, 
Progressive multifocal leucoencepalopathy (Papovavirus), Mycobacteria, Aspergillus, 
Cryptococcus, Candida, Cryptosporidium, Isospora belli, Microsporidia and any other 
diseases or conditions that are related to or will respond to the levels of HIV in a cell or 
tissue, alone or in combination with other therapies 
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The present body of knowledge in HIV research indicates the need for methods to 
assay HIV activity and for compounds that can regulate HIV expression for research, 
diagnostic, and therapeutic use. 

The use of antiviral compounds, monoclonal antibodies, chemotherapy, radiation 

5 therapy, analgesics, and/or anti-inflammatory compounds, are all non-limiting examples 
of a methods that can be combined with or used in conjunction with the nucleic acid 
molecules (e.g. aptamers, siRNA, antisense, and enzymatic nucleic acid molecules) of the 
instant invention. Examples of antiviral compounds that can be used in conjunction with 
the nucleic acid molecules of the invention include but are not limited to AZT (also 

10 known as zidovudine or ZDV), ddC (zalcitabine), ddl (dideoxyinosine), d4T (stavudine), 
and 3TC (lamivudine) Ribavirin, delvaridine (Rescriptor), nevirapine (Viramune), 
efravirenz (Sustiva), ritonavir (Norvir), saquinivir (Ihvirase), indinavir (Crixivan), 
amprenivir (Agenerase), nelfinavir (Viracept), and/or lopinavir (Kaletra). Common 
chemotherapies that can be combined with nucleic acid molecules of the instant invention 

15 include various combinations of cytotoxic drugs to kill cancer cells. These drugs include 
but are not limited to paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, gemcitabine, 
vinorelbine etc. Those skilled in the art will recognize that other drug compounds and 
therapies can be similarly be readily combined with the nucleic acid molecules of the 

20 instant invention are hence within the scope of the instant invention. 

Diagnostic uses 

The aptamers of the invention can be used to detect the presence or absence of the 
target substances to which they specifically bind, such as gp41 or gpl20. Such diagnostic 
tests are conducted by contacting a sample with the aptamer to obtain a complex that is 

25 then detected by conventional techniques known in the art. For example, the aptamers 
can be labeled using radioactive, fluroescent, or chomogenic labels. Interaction of labeled 
aptamer with the target can result in the detection of the target molecule via an ELISA 
type assay or sandwich assay, or by other means known in the art. Alternately, the 
aptamers of the invention can be used to separate or isolate molecules that specifically 

30 bind to the aptamer. For example, by coupling the aptamers to a solid support, target 
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molecules which bind to the aptamers can be recovered via affinity chromatography or 
analyzed by standard means known in the art. 

The enzymatic nucleic acid molecules of this invention can be used as diagnostic 
tools to examine genetic drift and mutations within diseased cells or to detect the presence 

5 of HIV RNA in a cell. The close relationship between enzymatic nucleic acid molecule 
activity and the structure of the target RNA allows the detection of mutations in any 
region of the molecule which alters the base-pairing and three-dimensional structure of 
the target RNA. By using multiple enzymatic nucleic acid molecules described in this 
invention, one can map nucleotide changes which are important to RNA structure and 

10 function in vitro, as well as in cells and tissues. Cleavage of target RNAs with enzymatic 
nucleic acid molecules can be used to inhibit gene expression and define the role 
(essentially) of specified gene products in the progression of disease. In this manner, 
other genetic targets can be defined as important mediators of the disease. These 
experiments can lead to better treatment of the disease progression by affording the 

15 possibility of combinational therapies (e.g., multiple enzymatic nucleic acid molecules 
targeted to different genes, enzymatic nucleic acid molecules coupled with known small 
molecule inhibitors, or intermittent treatment with combinations of enzymatic nucleic 
acid molecules and/or other chemical or biological molecules). Other in vitro uses of 
enzymatic nucleic acid molecules of this invention are well known in the art, and include 

20 detection of the presence of mRNAs associated with HIV-related condition. Such RNA is 
detected by determining the presence of a cleavage product after treatment with an 
enzymatic nucleic acid molecule using standard methodology. 

hi a specific example, enzymatic nucleic acid molecules which cleave only wild- 
type or mutant forms of the target RNA are used for the assay. The first enzymatic 
25 nucleic acid molecule is used to identify wild-type RNA present in the sample and the 
second enzymatic nucleic acid molecule is used to identify mutant RNA in the sample. 
As reaction controls, synthetic substrates of both wild-type and mutant RNA are cleaved 
by both enzymatic nucleic acid molecules to demonstrate the relative enzymatic nucleic 
acid molecule efficiencies in the reactions and the absence of cleavage of the "non- 
30 targeted" RNA species. The cleavage products from the synthetic substrates also serve to 
generate size markers for the analysis of wild-type and mutant RNAs in the sample 
population. Thus each analysis requires two enzymatic nucleic acid molecules, two 
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substrates and one unknown sample which is combined into six reactions. The presence 
of cleavage products is determined using an RNAse protection assay so that full-length 
and cleavage fragments of each RNA can be analyzed in one lane of a polyacrylamide 
gel. It is not absolutely required to quantify the results to gain insight into the expression 
of mutant RNAs and putative risk of the desired phenotypic changes in target cells. The 
expression of mRNA whose protein product is implicated in the development of the 
phenotype (i.e., HIV) is adequate to establish risk. If probes of comparable specific 
activity are used for both transcripts, then a qualitative comparison of RNA levels will be 
adequate and will decrease the cost of the initial diagnosis. Higher mutant form to wild- 
type ratios are correlated with higher risk whether RNA levels are compared qualitatively 
or quantitatively. The use of enzymatic nucleic acid molecules in diagnostic applications 
contemplated by the instant invention is more fully described in George et al, US Patent 
Nos. 5,834,186 and 5,741,679, Shih et aL, US Patent No. 5,589,332, Nathan et al, US 
Patent No 5,871,914, Nathan and Ellington, International PCT publication No. WO 
00/24931, Breaker et al, International PCT Publication Nos. WO 00/26226 and 
98/27104, and Sullenger et al, International PCT publication No. WO 99/29842. 

All patents and publications mentioned in the specification are indicative of the 
levels of skill of those skilled in the art to which the invention pertains. All references 
cited in this disclosure are incorporated by reference to the same extent as if each 
reference had been incorporated by reference in its entirety individually. 

One skilled in the art would readily appreciate that the present invention is well 
adapted to carry out the objects and obtain the ends and advantages mentioned, as well as 
those inherent therein. The methods and compositions described herein as presently 
representative of preferred embodiments are exemplary and are not intended as 
limitations on the scope of the invention. Changes therein and other uses will occur to 
those skilled in the art, which are encompassed within the spirit of the invention, are 
defined by the scope of the claims. 

It will be readily apparent to one skilled in the art that varying substitutions and 
modifications may be made to the invention disclosed herein without departing from the 
scope and spirit of the invention. Thus, such additional embodiments are within the scope 
of the present invention and the following claims. 
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The terms and expressions which have been employed are used as terms of 
description and not of limitation, and there is no intention that in the use of such terms 
and expressions of excluding any equivalents of the features shown and described or 
portions thereof, but it is recognized that various modifications are possible within the 
scope of the invention claimed. Thus, it should be understood that although the present 
invention has been specifically disclosed by preferred embodiments, optional features, 
modification and variation of the concepts herein disclosed may be resorted to by those 
skilled in the art, and that such modifications and variations are considered to be within 
the scope of this invention as defined by the description and the appended claims. 

In addition, where features or aspects of the invention are described in terms of 
Markush groups or other grouping of alternatives, those skilled in the art will recognize 
that the invention is also thereby described in terms of any individual member or 
subgroup of members of the Markush group or other group. 
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Table I: HIV env sequences (Subtype. Country, isolate year, isolate name) 



HIV env sequences 

A.BY.97.97BL006 
A.GB.-.MA246 
A.GB.-.MC108 
A.KE.90.K89 
A.RW.-.PVPI 

A. RW.-.SF1703 
A.SE.94.SE7535 
A.SE.95.SE8538 
A.SE.95.SE8891 

A.SE.95.UGSE8131 
A.UA.97.ukr970063 
A.UG.90.UG273A 
A.UG.90.UG275A 
A1.KE.93.Q23-17 
A1.SE.94.SE7253 
A1.UG.85.U455 
A1.UG.92.92UG037 
A2.CP.-.97CDKS10 
A2.CP.-.97CDKTB48 
A2.CY.94.94CY017.41 
A2C.ZM.89.ZAM174 
A2C.ZM.89.ZAM716-3 
A2C.ZM.90.ZAM18B 
A2D.KR.97.97KR004 
A2G.CD.-.97CDKP58 

AC.BE.-.VI313 
AC.1N.95.95IN21301 

AC. RW.92.92RW009 

AC. SE.96.SE9488 
ACD.SE.95.SE8603 

ACG.BE.-.VI1035 

AD. KE.90.K124A2 
AD.SE.93.SE6954 
AD.SE.95.SE7108 
AD.UG.-.C6080-10 

AD. UG.92.2UG035-22 
ADHK.NQ.97.97NOGIL3 

ADK.CD.85.MAL 
AG.BE.-.VI1197 
AG.BE.-.VI5251 
AG.CD.89.VI191A2 
AG.NG.92.92NG003 
AGHU.GA.-.V1354 
AGJ.BW.98.BW2117 
AGU.CD.76.Z321 
AU.NG.94.NG3678 
AU.NG.95.NG1935 
AU.SE.93.SE6594 
B.ALUVH 

B. AU.86.MBC200 



82 



WO 03/102131 



PCT7US03/12626 



B.AU.87.MBC925 
B.AU.93.MBC18 
B.AU.95.MBCC54 
B.AU.96.MBCC98 
B.AU.96.MBCD36 
B.BE.-.SIM184 
B.CA.-.CA5 
B.CN.-.RL42 
B.DE.86.D31 
B.DE.86.HAN 
B.ES.89.89SPQ61 
B.FR.-.PH1120 
B.FR.-.PH1133 
B.FR.-.PH1146 
B.FR.-PHI153 
B.FR.-.PH1159 
B.FR.-.PIH155 
B.FR.-.P1H160 
B.FR.-.P1H309 
B.FR.-.PIH373 
B.FR.-.PIH374 
B.FR.83.HXB2 
B.GA.-.OYl, 
B.GB.-AC-46 
B.GB.-.CAM1 
B.GB.-.GB8.C1 
B.GB.-.HIM286332 
B.GB.-.H1M286336 
B.GB.-.H1M286337 
B.GB.-.JB 
B.GB.-.M23470 
B.GB.-.M24244C3 
B.GB.-.M26864 
B.GB.-.M30156 
B.GB.-.M737677 
B.GB.-.M737685 

B.GB.-.MB314 
B.GB.-.PE052C1 
B.GB.-.PE104C38 
B.GB.-.PE131C3 

B.GB.-.WB 
B.GB.59.MANC 

B.JP.-.ETR 
B.JP.86.JH32 
B.KR.-.WK 
B.NL-.68A 
B.NL.-.ENWA 
B.NL-.ENWF 
B.NL.-.ENWG 
B.NL.86.3202A21 
B.NL.86.H0320-2A12 
BTH.93.93TH067 
B.TT.-.QZ4589 
B.TW.-.TWCYS 
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B.UA.-.UKR1216 
B.UNK.-.NL43E9 
B.US.-.546BMB4 
B.US.-.ADA 
B.US.-.BORl 
B.US.-.BRVA 
B.US.-.C26-12.1BH 

B.US.-.DH123 
B.US.-.M02-3.SW 
B.US.-.NC7 
B.US.-.P896 
B.US.-.SF128A 
B.US.-.US1 
B.US.-.US2 
B.US.-.US3 
B.US.-.US4 
B.US.-.WMJ22 
B.US.83.RF 
B.US.83.SF2 
B.US.84.CDC452 
B.US.84.MNCG 
B.US.84.NY5CG 

B.US.84.SC 
B.US.84.SC141 
B.US.84.SC14C 
B.US.85.85WC1PR54 
B.US.85.ALA1 
B.US.85.SFMHS11 
B.US.85.SFMHS21 
B.US.85.SFMHS3 
B.US.86.JRCSF 
B.US.86.JRFL 
B.US.86.SFMHS1 
B.US.86.SFMHS1 6 
B.US.86.SFMHS17 
B.US.86.SFMHS18 
B.US.86.SFMHS2 
B.US.86.SFMHS4 
B.US.86.SFMHS8 
B.US.86.YU2 
B.US.87.BC 
B.US.87.SFMHS5 
B.US.87.SFMHS7 
B.US.87.SFMHS9 
B.US.88.SFMHS19 
B.US.88.SFMHS6 
B.US.88.WR27 
B.US.89.R2 
B.US.89.SFMHS20 

B. US.90.WEAU160 
B.US.92.92US657.1 

BC.CN.-.CHN19 
BF1.BR.93.93BR029.4 

C. BI.91.BU910112 
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C.BI.91.BU910213 
C.B1.91.BU910316 
C.BI.91.BU910423 
C.BL91.BU910518 
C.BI.91.BU910611 
C.BL91.BU91Q717 
C.BI.91.BU910812 
C.BR.92.92BR025 
C.BW.96.96BW01B03 
C.BW.96.96BW0402 
C.BW.96.96BW0502 
C.BW.96.96BW11B01 
C.BW.96.96BW1210 
C.BW.96.96BW15B03 
C.BW.96.96BW16B01 
C.BW.96.96BW17B05 
C.DJ.91.DJ259A 
C.DJ.91.DJ373A 
C.ET.86.ETH2220 
C.IN.-.H1M276221 
C.IN.93.931N101 
C.1N.93.931N904 
C.1N.93.93IN905 
C.IN.93.931N999 
C.1N.94.941N11246 
C.IN.95.951N21068 
C.SN.90.SE364A 
C.SQ.89.SQ145A 
C.UG.9Q.UG268A2 
CD.BI.91.BU910905 
CPZ.CD.-.CPZANT 
CPZ.CM.-.CAM3 
CPZ.CM.98.CAM5 
CPZ.GA.-.CPZGAB 
CPZ.US.85.CPZUS 
CRF01 AE.CF.90.90CF1 
CRF01 AE.CF.90.90CF4 
CRF01_AE.CM.-.CA1Q 
CRF01 AE.TH.90.CM240 
CRF01_AE.TH.92.TH920 
CRF01 AE.TH.92.TH921 
CRFQ1 AE.TH.93.93THQ 
CRF01 AE.TH.93.93TH2 
CRF01 AE.TH.93.KH03 
CRF01 AE.TH.93.KH08 
CRF01_AE.TH.94.A0102 
CRF01 AETH.94.E1142 
CRFQ1 AE.TH.95.95TNI 
CRF01 AE.TH.95.N1114 
CRF01 AE.TH.95.N1115 
CRF01_AE.TH.95.TH022 
CRF01 AE.TH.96.N1115 
CRF02 AG.CM.97.MP807 
CRF02_AG.DJ.91.DJ258 
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CRF02 AG.FR.91.DJ263 
CRF02 AG.FR.91.DJ264 
CRF02 AG.GH.-.G829 
CRF02 AG.NG.-.IBNG 
CRF02 AG.NG.94.NG367 
CRF02 AG.NG.95.NG192 
CRF02 AG.SE.94.SE781 
CRF02 AG.SN.98.MP121 
CRF03 AB.RU.-.KAL68. 
CRF03 AB.RU.97.KAL15 
CRF03 AB.RU.98.RU980 
CRF04 cpx.CY.94.94CY 
CRF04 cpx.GR.91.97PV 
CRF04 cpx.GR.97.97PV 
CRF05_DF.BE.-.V11310 
CRF05 DF.BE.93.V1961 
CRF06 cpx.AU.96.BFP9 
CRF06 cpx.ML.95.95ML 
CRF06 cpx.NG.94.NG36 
CRF06 cpx.SN.97.97SE 
CRF10 CD.TZ.96.96TZB 

CRF11 cpx.CM.-.CA1 
CRF11 cpx.CM.-.MP818 
CRF11 cpx.FR.-.MP129 
CRF11 cpx.FR.-.MP130 
CRF11 cpx.GR.-.GR17 
CRF11__cpx.NG.94.NG36 

D.CD.-.JY1 

D.CD.83.ELI 
D.CD.83.NDK 
D.CD.84.84ZR085 
D.CD.85.Z2Z6 , 

D.CI.-.C113 

D.SN.90.SE365A2 
D.TZ.87.87TZ4622 
D.UG.-.C971-412 
D.UG.-.WH01 5-474 
D.UG.90.UG266A2 
D.UG.90.UG269A 
D.UG.9Q.UG274A2 
D.UG.92.92UG024-D 
D.UG.94.94UG1141 
F1.BE.93.Vt850 
F1.BR.89.BZ126 
F1.BR.93.93BR02Q.1 
F1.FI.93.FIN9363 
F1.FR.96.MP411 
F2.CM.-.CA2Q 
F2.CM.-.HIM277819 
F2.CM.95.MP255 
F2.CM.95.MP257 
F2KU.BE.94.VI1126 

G.BE.96.DRCBL 
G.FL93.HH8793-12.1 
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G.GA.-.LBV217 
G.NG.92.92NG083 
G.NG.95.NG1928 
G.NG.95.NG1929 
G.NG.95.NG1937 
G.NG.95.NG1939 

G. SE.93.SE6165 
GH.GA.90.VI525 

H.BE.93.V1991 
H.BE.93.VI997 

H. CF.90.90CF056 
J.SE.93.SE7887 
J.SE.94.SE7022 

K.CD.97.EQTB11C 
K.CM.96.MP535 
MO.CM.97.97CAMP645MO 
N.CM.-.YBF106 
N.CM.95.YBF30 
O.CM.-.ANT70 
O.CM.-.CM4974 
Q.CM.91.MVP5180 
O.CM.93.HIV1CA9EN 
O.GA.92.VI686 
O.GQ.-.193HA 
O.GQ.-.276HA 
O.GQ.-.341HA 
O.SN.99.SEMP1299 
O.SN.99.SEMP1300 
U.CD.83.83CD003 
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Table H: 



A. Z5 umol Synthesis Cycle ABI 394 Instrument 



Reagent 


Equivalents 


Amount 


Wait Time* DNA 


Wait Time* 2'-0-methyl 


WaitTime*RNA 


Phosphoramidites 


6.5 


163 pL 


45 sec 


2.5 min 


7.5 min 


S-Ethyl Tetrazole 


23.8 


238 pL 


45 sec 


2.5 min 


7.5 min 


Acetic Anhydride 


100 


233 pL 


5 sec 


5 sec 


5 sec 


A/-Methyl 
Imidazole 


186 


233 pL 


5 sec 


5 sec 


5 sec 


TCA 


176 


2.3 mL 


21 sec 


21 sec 


21 sec 


Iodine 


11.2 


1.7 mL 


45 sec 


45 sec 


45 sec 


Beaucage 


12.9 


645 pL 


100 sec 


300 sec 


300 sec 


Acetonitrile 


NA 


6.67 mL 


NA 


NA 


NA 



B. 0.2 umol Synthesis Cycle ABI 394 Instrument 



Reagent 


Equivalents 


Amount 


Wait Time* DNA 


Wait Time* 2'-0-methyl 


Wait Time*RNA 


Phosphoramidites 


15 


31 pL 


45 sec 


233 sec 


465 sec 


S-Ethyl Tetrazole 


38.7 


31 pL 


45 sec 


233 min 


465 sec 


Acetic Anhydride 


655 


124 pL 


5 sec 


5 sec 


5 sec 


N-Methyl 
Imidazole 


1245 


124 pL 


5 sec 


5 sec 


5 sec 


TCA 


700 


732 pL 


10 sec 


10 sec 


10 sec 


Iodine 


20.6 


244 pL 


15 sec 


15 sec 


15 sec 


Beaucage 


7.7 


232 pL 


100 sec 


300 sec 


300 sec 


Acetonitrile 


NA 


2.64 mL 


NA 


NA 


NA 
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C. 0.2 pmol Synthesis Cycle 96 well Instrument 



Reagent 


mm » _ 1 A rMI A t 

Equivalents:DNA/ 
2'-0-methyl/Ribo 


Amount: dna// -u- 
methyl/Ribo 


\AJVit Timp* DNA 

Weill 1 Milt? U l» AA 


Wait Time* 2'-0- 
methyl 


Wait Time* Ribo 


Phosphoramidites 


22/oo/OD 


An/ftnyi9n ni 


60 sec 


180 sec 


360sec 


S-Ethyl Tetrazole 


70/105/210 


40/60/120 pL 


60 sec 


180 min 


360 sec 


Acetic Anhydride 


265/265/265 


50/50/50 pL 


10 sec 


10 sec 


10 sec 


W-Methyl 
Imidazole 


502/502/502 


50/50/50 pL 


10 sec 


10 sec 


10 sec 


TCA 


238/475/475 


250/500/500 pL 


15 sec 


15 sec 


15 sec 


Iodine 


6.8/6.8/6.8 


80/80/80 pL 


30 sec 


30 sec 


30 sec 


Beaucage 


34/51/51 


80/120/120 


100 sec 


200 sec 


200 sec 


Acetonitrile 


NA 


1150/1150/1150 pL 


NA 


NA 


NA 



• Wait time does not include contact time during delivery. 
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Table III: HTV env target sequences 



Sequence 


Seq ID 


AAAAAUAACAUGGUA 


1 


AAAAAUAUUCAUAAU 


2 


AAAAGAAUGAACAAG 


3 


AAAAUAACAUGGUAG 


4 


AAACUGCUCUUUCAA 


5 


AAAGAAUGAACAAGA 


6 


AAAGAGAAGAGUGGU 


7 


AAAGCCAUGUGUAAA 


8 


AAAGCCUAAAGCCAU 


9 


AAAUAACAUGGUAGA 


10 


AAAUAUAAAGUAGUA 


11 


AAC AUGAC CUGGAUG 


12 


AACAUGUGGAAAAAU 


13 


AACGCUGACGGUACA 


14- 


AACUCACAGUCUGGG 


15 


AAGAAGAAGGUGGAG 


16 


AAGAGAAGAGUGGUG 


17 


AAGCAAUGUAUGCCC 


18 


AAGCACUAUGGGCGC 


19 


AAGCCUAAAGCCAUG 


20 


AAGUGAAUUAUAUAA 


21 


AAUAACG CUG ACGGU 


22 


AAUAGAGUUAGGCAG 


23 


AAUAUUCAUAAUGAU 


24 


AAUCAGUUUAUGGGA 


25 


AAUGAUAGUAGGAGG 


26 


AAUGGCAGUCUAGCA 


27 ! 


AAUGUACACAUGGAA 


28 


AAUGUAUGCCCCUCC 


29 


AAUGUCAGCACAGUA 


30 


AAUUAUAUAAAUAUA 


31 


AAUUCCCAUACAUUA 


32 


AAUUGGAGAAGUGAA 


33 


AAUUUGCUGAGGGCU 


34 


ACAAUGUACACAUGG 


35 


ACAAUUAUUGUCUGG 




ACAAUUGGAGAAGUG 


37 


ACACAUGCCUGUGUA 


38 


ACAGACCCCAACCCA 


39 


ACAGGCCAGACAAUU 


40 


ACAGUACAAUGUACA 


41 , 


ACAGUCUAUUAUGGG 


42 


ACAUGCCUGUGUACC 


43 


ACAUGUGGAAAAAUA 


44 1 


ACCACCGCUUGAGAG 


45 


ACCCACAGACCCCAA 


46 


ACCC CAACC CACAAG 


47 


ACCUGGAGGAGGAGA 


48 


ACCUGUGUGGAAAGA 


49 


ACGCUGACGGUACAG 


50 


ACGGUAC AGG C C AGA 


51 
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a pt in apa pttpt tpppp 


52 


TV P TV Tt TV 71 TV TV P 7\ P P A PT ^ 

AGAAAAAAGAG G AvjU 


53 


apa apaa ppttpp.tap.a 


54 


7\ PT tv-i 7v RTTTTATTaTT 


55 


A P A P 7\ 7V T TTt A T TT TPT TPT T 

AbALAAU UAU U Vjj U C U 


56 


7\ P A fCCC A a PPP 71 P a 


57 


aGAPPTTGGAGGAGGA 


58 


anapa A A AAAGAGPA 


59 


aGaGAGAAAA AAGAG 


60 


ap a gt tt t acnr aggg a 


61 


APaiTGPTTA A AGPAITA 


62 


appa attgttattgpppp 


63 


ahpaptt^t tgggpg p a 


64 


A GP AGP Attn A AG C AC 


65 


a GP A RO A AGP APT TAU 


66 


aGPPT T A A AGPP A1IGU 


67 


aPPPT TGT TGPPT TPT TT TP 
l_ U vj U vj v_ t_ U V— U U v~ 


68 


jxppi tpp appp a an AG 


69 


appa APPaPTTaTTGGG 
AGIjAA\jL,/\v-U*mJvjvjvjx 


70 


APPAPPAPA T T A TTP_7\ CZ 


71 


A PPAPTTAPPRPPPBI* 1 

AGG Ab UAu UALv^LAL 


72 


APP7VTTPA A PAP PT TP P 

AGGAU LAALAb^UtL 


73 


a pppa apapi TppnnPi 
AGGG AAGAG ULtU vjrfj 


74 


APPPAPPPATTB PTTPA 
AbuLAbuuAUALULA 


75 


APPPAPA ATTTTPP APa 

AGGGAGAAU UoijAia/i 


76 


A PTT7V PA ATTPIT&PflPfl 

AG U AC AAU U AL AL i\ 


77 


Ar'ITPTTaTTTTaTTP.GGGTT 


78 


A PTTP AATTTTATTATTAAa 

AbUuAAU UAUAUAAA 


79 


TtPITTTAPPPaPPPZilTli 


80 




81 


APTTTTTTATTPPpaTTPaa 


82 


aTTaaaaaTTATTTTPATTA 


83 


aTTAAATTATTAAAGTTAG 


84 


aTia APGPTTGAPGGI TA 


85 


aTTaATTPAGTTTTTTATTGG 


86 


aTTaaTTHATTAGTlAGGA 


87 


AT TAP APTTf TanP.PaP.P2 
AUAuAuU UiAvjjOv^/Aoo 


88 


ATTAPTTPPA aPaPPA A 


89 


attatta a aTTaTTaaaGTt 


90 


BTTaTTaaTTPaGTTTTTTATT 


91 


ATiaT TP aGGG AP A aTTTT 


92 


BTTaiTITPaTlA ATTGATTA 


93 


aTTPaa ATTATTTTAPAGG 


94 


TV TTP A APAPPT7PPTTAP 


95 


aitpap attpptta a app 

AULAuAUbLUAAAvjL 


96 


TV ¥ T/*~i "T\ /"IT TT TT T "ft T TPPP 7V T t 

AUCAGUUUAUGGGAU 


Q*7 


TV TTP APPPAPA ATTTTPP 

AUGAGGGAGAAUUGG 


98 


AUGAUAGUAGGAGGC 


99 


AUGCCUGUGUACCCA 


100 


AUGGC AGUCUAG CAG 


101 


AUGGGGUACCUGUGU 


102 


AUGUACACAUGGAAU 


103 


! AUGUAUGCCCCUCCC 


104 


AUGUCAG C ACAGUAC 


105 
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a t tpt tp p a a a a an a a p 


106 


atttta apa Tip anai tpp 


107 


ATTTTATTATTA AATTZ1TTAA 

AU UAU AUAAAUAU/\tt. 


108 


A T TT T A T tnfP PT T A P PT IT* 

AUUAUbbbbUAbbUbi 


109 


ATTTTP AT7A ATTPATTAPTT 

AU UbAUAAUbAUAbU 


110 


ATTTTPPP ATTAP ATTTTAT T 


111 


ATTTTPP A PA APT TP A ATT 


112 


ATTTTPTTPTTPPTTATTAriTT 


113 


ATTTTTTPPT TPftPPPPTTA 


114 


ATTTTTTnA APATTPTTPPA 


115 


ATTTTTTTTPTTPPATTPAPA 
AU U U UbUbbrtUb-Avj/T. 


116 


b AAAbAbAAb AbU bb 


117 


/-1 7\ tv APPPTTA A APPPA 

b AAAb b b U AAAb b b A 


118 


o« 7v OTTPA P A PT TPT TPP 

C AAb U b Ab Ab U b U bb 


119 


/-tfk. 7\ tt a a pipphtp A PPP 
C AAUAAbbb U bAbb b 


120 


nil ATTPTTA P A PATTPP A 

I CAAUbUAbAbAUbbA 


121 


prv ATTPTTATTPPPPPTTP 


122 


PA ATTI TPPPATTAPATTTT 


123 


P A ATTTTPP AP A APTTPA 


124 


PA ATTTTTTPPT TP APPPP 


125 


P A P A P A PPPP A A PPP 


126 


o» A/~rr TPfT ATTTTAT TPP 

CAGAGUbUAUUAUbb 


127 


p a p a pt tpt iropp P AT T 
b AC AbU b U bbbb b A.U 


128 


pa niMTPPPTTPITPTTAP 

bAbAUbbbUbUbUAb . 


129 


P7\PIT7kTTPPPPPPaPP 

C AC U AU bb b bbbAb b 


13 0 


/~i A /""» A AT TTTATTTTPT TP* 

CAbAb AAU U AU Ub U b 


131 


PAPAPPrra apppap 
LAbAbbLLAALbbAb 


132 


i P AP A PPTTPP APP APP 

CAbAbb UbbAbbAbb 


133 


P7\PATTPPITR A APP ATT 

b AbA U bb U AAAb b AU 


134 


pi\pni\ PTiPTIAPS AT7P 
b Ab b Ab Ab U Ab Art-Ub 


135 


nAPHAPPJl APPAPTTA 

CAbb AbbAAbb Ab UA 


136 


f~\ /-?/-1¥ T*^1/*1 7V P»/~*/^ A A P A 

CAGCUCCAGGCAAGA 


137 


CAGGAAGCACUAUbb 


138 


/-» A P»P«P' A APA PTTPPTTP 

CAGGbAAbAbU bb Ub 


139 


/-»■* y-"i/-»/"V' AP A PA A T TT TA 

CAbGbbAbAbAAU U A 


140 


r"> A PT TA OA ATTPTTA PAP 1 

CAGUACAAUGUACAb 


141 


y-i 7\ PTTPTTTV TTTTATTPPPP 

CAGUbUAUUAUbbbb 


142 


PA PUTTTTT ATTPPP ATTP A 

bAbUUUAUbbbAUbA 


143 


PATTA ATTPAT 1 AP1TAPP 
bA U AAU b/iU/i\3 U \3 


144 


PATTAPATTTTATTTTPT7PP 


145 


PAT TP A P ATTPnTA A AP 
bAUbAbAUbbUAHAb 


146 


PATTPPPT TPTTPTTB PPP 

bAUbbbUbUbUAbbb 


147 


PAT TPTTPP A A A A ATT A A 
bAUb UbbAAAAAU AA 


148 


7\ ATTTTPPP ATT A PATT 

CCAAUUbbbAUAbAU 


14 9 


pp a pa p a pppp A A pp 
C CAb Ab Abb bb AAb b 


150 


C C AC CGCUUG AG AGA 


1 til 


CCAGACAAUUAUUGU 




CCAGGCAAGAGUCCU 


153 


CCAUACAUUAUUGUG 


154 


CC CAACCCAC AAGAA 


155 


CCCACAGACCCCAAC 


156 


CCCAUACAUUATJUGU 


157 


CCCCaACCCACAAGA 


158 


CCGCUUGAGAGACUU 


159 



92 



WO 03/102131 



PCT/US03/12626 



rim t tv Tv 7\ pnp7\T IPT 7P 1 ! T 

CCUAAAGCLAUGUGU 


160 


CCTlCUuCAGLUALLA 


161 


r~>r~*j t/T"* TvP'P'AP'P'AP'ATT 

L C UGGAGGAGG AGAU 


162 


«-»/— tTf/-i/»-«/"iT TP<T TP 1 P 1 A A TiP 

CCUGGCUGUGGAAALr 


163 


nmi/^T T/T^P'T TP*T TT TP* A P 1 

C CUGUGCL U L U U LAG 


164 


/"•P'TTPMTP'TTA PrnS PAP 

CCUGUGUALLLALALi 


165 


CCUUGGGUUCUUGGG 


-LOO 


CCUUUGAGCCAAUUC 




CGCUGACGGUACAGG 


J.OO 


CGGUACAGGCCAGAC 


ICQ 


CUAAAGCCAUGUGUA 


J. /U 


CUAAAGGAUCAACAG 


JL / X 


CUAGUUGGAGUAAUA 


1 "70 
J. / Z 


CUAUUAUGGGGUACC 


J. / j 


Cuauuuuguglauca 


174 


/-»TT/"1 TV r^7\ HTTP! TPPPr'P 

cuciacagucuggggl 


175 


C U LL AGGLAAG AG U L 


176 


mTPITPPA A A AP^TTfATT 

Cucuggaaaagul.au 


177 


CUCUUCAGC UAL LAG 


178 


CUGACGGUACAGG C C 


1 7 Q 


CUGGAGGAGGAGAUA 


180 


CUGGCUGUGGAAAGA 


X O X 


C U GGUAUAGUG C AAC 


182 


CUGUGCCUCUUCAGC 


loo 


CUGUGGAAAGAUACC 


1 R4 

J. 0*i 


CUGUGGUAUAUAAAA 


1 ft*; 
lOJ 


CUGUGUAC C C ACAG A 




CUUCAGACCUGGAGG 


lo / 


CUUCAGCUACCACCG 


TOO 

loo 


CUUGGGAG C AG CAGG 


189 


CUUGGGUUCUUGGGA 


190 


CUUUGAGCCAAUUCC 


191 


GAAAAAUAACAUGGU 




GAAAAGAAUGAACAA 




GAAGAAGAAGGUGGA 




GAAGAAGGUGGAGAG 




/TTV TV r*l K~« TV /~^T T TV T T/"' P» P PV< 

GAAGCACUAUGGGCG 




/~% 7* TV OTTr»7l TV T TT T "IV TTTATT7V 

GAAGUGAAUUAUAUA 


1 Q7 


P" A ATTI7ATTATTA A ATT ATT 

GAAU UAU AU AAAUAU 


198 


G AG AAU UAU UG U L U G 


1 QQ 


TV /"I TV TV TTT T/~«/^«H P»TV TV PIT 

GACAAUUGGAGAAGU 


zuu 


/TTV /in/1/1T\ TV /l*^/""* TV /I TV TV 

GACCCGAACCCACAA 




tv /i/TiT/i/TTi TV yn/r TV /""* 

GACCUGG AGGAG GAG 


7 fi7 


GACGGUACAGGCCAG 


203 


gagaaaaaagagcag 




GAGAAGUGAAUUAUA 


205 


GAGAGAAAAAAGAGC 


; 206 


GAGL AGL AGGAAG LA 


207 


GAGCCUGUGCCUCUU 


208 


GAGGAGGAGAUAUGA 


209 


GAGGGACAAUUGGAG 


210 


GAGUUAGGCAGGGAU 


211 


GAUAGUAGGAGGCUU 


212 


GAUAUAAUCAGUUUA 


213 
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p a t tp abpb ppt tpptt a 
GAUGAAGAoCUuuua 


214 


P ATTPPTTA A APPATTATT 


215 


p p a a pt tpa p a pt tpt tp 

Ljv_ AAL. U A^ MajjU^Uvjj 


216 


ppi\ 7\ r 1 APTTPPTTPPPI7 


217 


Pr»A ATTPTTATTPPPPPTT 


218 


PP A PTTATTPPPPPPAP 

GGAG UAUGGGL.VjL.A1j 


219 


PP 7v APP A APPAPTT 

GCAGGAGGAAGGAGU 


220 


PP7\ PP A A PP A /""IT TAT TP* 

GCAGGAAGGAG U AUG 


221 


GGAGGGAUACUCACL 




^ /*1 t TV TV T T^/"Vf T7V TV 7V 

G GAUCAGAUGC UAAA 


223 


z~i rip 7V P 7\ P 7\ 7\ T TT T A T TT TP 1 

GCGAGAGAAU UAU Ub 


224 


r»fr«m\ A APPPATTPTTP 
GCCUAAAGGGAULjULj 


225 


GCGULUUGAGLUALL 


226 


PPPT TPT TP PPT TPT TTTP A 

GCG U GU GGG U L-U U UV 


227 


PPPT TP T TPT t a PPP a P & 
Lr L U Lr U Lr UAL. l«^f\^.ri 


228 


ppttppapppaapaptt 

Lt L U L L. ALj vj k_~AAV3.rVJ U 


229 


p/ttpttpp a a a aPTTPA 

LiL U L U VjLtAAAAL U LA 


230 


ppttp apppttapappp 

LtLULtAL.LiL»UA\-,Av30V«. 


231 


PPTTPTTPPA A APATTAP 


232 


PPTTPT 7PPTTATTATTA A A 

GGUGUGGUAUAUAAA 


233 


pp t\ a A A ATT A APA1TPP 

GGAAAAAUAALAULtLt 


234 


PP A 7\ nps PTT ATTPPPP 
LrGAALrGALUAULrLTLjL 


235 


PP A PA ATTTTPPA PA AP 


236 


PPA PA APTTPA ATTTTZ1TT 
GGAGAALtULtAAU UAU 


237 


/"i/""i7\/T~»7\P'/"" , 7\f 1 P , A A PP 

GG AG CAG L AGGAAGG 


23 8 


/-■/-I -jv nrtrtt TPT TPPPTTPTT 

GGAGCCUGUGGGULU 


239 


^riT\ Str* TV TV TV TTT\ TT/"* 

ggaggaggagauaug 


24 0 


ggaucaacagcuccu 


Oil 

z*± ± 


ggcaagaguc cuggc 


OA"? 
ZffcZ 


gg c aggg auacucac 


9 4 ~* 


ggcagucuaggagaa 


244 


/""»/"-■ /-tt Tp t TPP A 7\ 7\ P7\T T A 

GGCUGUGGAAAIjAUA 


245 


pp PI TPT TPPT T7\ TT7V TT A A 

GGCUGUGGUAUAUAA 


246 


PPP TV PTV 7V T TT VP A P A A 

GGGACAAU UGGAGAA 


247 


PPPAPPAPPA PP A AP 

GGG AG GAGGAGL* AALr 


248 


PPPP TV ft PPP A PA PPP 

GGGGAL G GGAGAGGL 


249 


PPPPTT7V PPTTPTTPTTPP 

GGGGUACCUGUGUGG 


250 


PPPTTAPPTTPTTPITPP A 

GGGU ALL. U Li ULt U IjLtA 


251 


PPPTTPAPAPTTPTTATTTT 

GGGU GAGAGUL. UAU U 


252 


PPPT TT Tr*T TT TPPP A P P A 

GGGUUGUUGGGALrLA 


253 


PPTTA PAPPPPAP AP A 

GG U AG AGG G L ALtALA 


254 


PPT TA P PT TPT TPT JC* A A 

GGUAGGUGUGUGLiAA 


255 


PPT T TV TT7V PT TP PA A P A P 

GGUAU AGU GGAALALr 


256 


/^1/™ITTTVTT7VTTTV TV 7V 7\ 7VT T7V T T 

G Gu au auaaaaau au 


2 57 


ggucacagucuauua 


258 


G GUUCUUGGGAGCAG 


Zbi* 


guacaauguacacau 


Z D U 


PTTTV PA PAT TPP A ATTTTA 

j GUAGAGAUGGAAUUA 


261 


GUACAGGCCAGACAA 


262 


GUACCCACAGACCCC 


263 


GUACCUGUGUGGAAA 


! 264 


G UAUAG UG C AACAGC 


265 


GUAUAUAAAAAUAUU 


266 1 


GUAUGCCCCUCCCAU 


267 
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pttp a elt ta apppttp ap 


268 


PT TP A P 21 PT TPT 7 AT TT T AT 7 


269 


PTTPA P P APAPT TA P A A 


270 


r*T TP*T TA T TT T AT TPPPPT T A 


271 


r>l TP«T TPPTT ATT A PTTPP A 


272 


/-<t t/-i ATTTTATTATTA A ATT 

GUGAAU UAUAU AAAU 


273 


/**7 Tf A T If A A T TP* PT T A 

GUGC AU C AGAU GC U A 


974. ! 


GUGCCUCUUuAt3L.UA 


275 


/~tTT/™»<^» A AAAATTAAP*ATT 

GUGGAAAAAUAACAU 


A / D 


PTTPPIV A AP 1 ATT A PPUTA 

GUGGAAAGAUAL.L U A 


277 


GUGGAACUUCUGGGA 


278 


PT TPP PT TP A P & PT TPT TA 


279 


mTPVTTATTATTA a a a att 

Vj ULiLjUrtUrtUrtrt-rtrtrtU 


280 


PT TPT T A PPP A P AP A PP 
bUu U/\UUUriU/ikV3rtU U 


281 


PTTTTAPPPAPPPATTAP 


282 


PT TT TPPTTT TPPPT 77 7PT TT T 


283 


C T TT TPT TT TPPP A PP A PP 


284 


PTTTTPPA APTTPAPAPT7 
bUU bUfuiL U LnWio U 


285 


PTTT TT T AT TPPP AT TP A A A 
Lj U U UnUbuUflU U.rtrvrV 


286 


TTA A A A ATTATTTTPATTA A 
UAAAAAUAUUL.AU/Vrt. 


287 


TTA A A n PP AT TPITPTT A A 


288 


TTA A ATTATTA A APTTAPTT 


289 


TTA A <" , ATTP ,, TTP , P 1 A A A A A 

U AALAUL? UL7L7AAAAA 


290 


TTA A P^PTTP 1 AP'P^P'TTA P 1 

U AACG C U GALLiLj U AL. 


291 


TTA ATTr»Ar , TTTTTTATTr , r , r* 

U AAU CAG U U U AULjLiL-» 


292 


UAAUGAU AG U AGG AG 




▼ T TV TV TV T T/^TT TV /"I TV /"I TV I T/** 

UACAAUGUACACAUG 




T TTV T* /"T<^» /"I /"I T\ TV ^ TV TV T T 

! UACAGGCCAGACAAU 


OQC 


UAC CAC CGCUUG AGA 


D 


UACCCACAGAC C C CA 


z y / 


UACCUAAAGGAuCAA 


TOR 
<5 O 


t tt\ rinrir'TTrtT AAA/""* 
UACCUGUGUGGAAAG 


TOO 
Z J J 


UAGAGU UAGGCALjLjLt 


300 


TTA PP7\PTT7\PPB P^P^P" 1 A 

U AGGAG U AGC ALLL A 


7fn 

JvJ. 


nTiPPPTVPPPBTTZlPTTP 
U AL7L7L ALiLiLrrtU/iL. U 


302 


TTAP^TTHPA APAPPA A A 


303 


i TTAP'TTTTnPAPTTAATTA A 
UrtLiU ULTLrrtwUrtrtUrVrt 


304 


TTATTA A ATTATTA A APTTA 
U rt. U rtrtrt U rt Urtrtrto U -ri 


305 


TTATTA ATTPAPTTTTTTATTP 


306 


TTA IT A /""TT/^/^A A P7\PP3i 

UAUAGUGCAACAGLA 


7 f)7 

Jv/ 


TTATTATTA A ATTATTA A AP 
UAUAUArtrtUrtUrtrtrtVa 


308 


TT7M7P7\PPP7\ P7\ A T TT TP 1 

UAUGAGGGAC AAU U G 




TTATTA r 1 P , P , P 1 TTP , OP 1 A TTP 1 

UAUGCCCLULL.L.AUL- 


310 


T77\TTPPPPTT7\ /^/*tT T/^T TO 

UAUGGGGUACCUGUG 




TTATTTTA Af'A )\P7\P ATTP 1 

UAUU AAC AAGAGAU Lr 


71 9 


UAUUAUgGGGUACCU 


"3 11 
6 -L J 


tTTiT YtT^llitTTV T\ T tfl TV T T 7\ 

UAUUCAUAAUGAUAG 




T T Al TT TPT TPT TPP7 7 A7 7 AP 


315 


UAUUUUGUGCAUCAG 


316 


UCAACAGCUCCUAGG 


317 


UCAAUAACGCUGACG 


318 


i UCACAGUCUAUUAUG 


319 


UC ACAGUCUGGGG CA 


320 


UCAGACCUGGAGGAG 


321 
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t tp a p tit tppt ta a a a 

U v_..fivji*vJ VjV.. UrtHHljLn 


322 


ttpappapapttapaatt 

U V^>4VjV— /AL»AVjU/^V^rVrllJ 


323 


urAGUUUAUGGGAUC 


324 


TTPATTAATTP.ATTAGT7AG ! 

U V^ rtvJr^rtVJVj.ri.VJ.riVJ U.TVJ 


325 


TTPPAGGPAAGAGTTPG 


326 


TTPPPATTAPATTTTAT7I7G 


327 


T 7PPT 7GG PUGUGG AAA 


328 


T TPPT TT 7GGGT 71 7GT 71 JGG 


329 


t7CUAtTTJAUGGGGlJAC 


330 


T TPT 7GGT T AT T AGT 7GG A A 


331 


TJCTJUCAGCUACCACC 


332 


UCUUGGGAGCAGCAG 


333 


UGAAUUAUAUAAAUA 


334 


UGACGGUACAGGCCA 


335 


UGAGGGACAAUUGGA 


336 


TTPATTAGTTAGGAGGPTT 

VJ VJrl VJ/VJ U r\VJ VJ.MVJ VJ Vw U 


337 


TTPPA APTTPAPAGTTPTT 
U Vj V^firiV- VJ V«<JAV,/\\J U V^ vj 


338 


TTPPaTTPAP.ATTP.PTTA A 


33 9 


ITPPPTTPTTTTPAPPTTnP 

U vj V- U U C U U Av3 L. U/VL. 


340 


UCjCCUvjUvjUAv^L.uAU 


*id 1 
j*i 


ITPPTTPTTPPa A a RPTTP 

UvjC U L. U vjijAAAAU U u 


342 


TTPPaa a a aTTa a paTTP 

Uvjvj/A/\/\A/\Ui\/\Vw/4Uvj 


343 


TTPPAA APATTAPPTTA A 
U Vj Vj/^SJ>i.Vj/-l Ui^V— 1_ \ 


344 


T 7HP» A A PT TT TPT 7P»nR A P 

VJVJVJfxrVV.. VJ VJ V-- LJ ^JjvjVJj^t.'sw, 


345 


T TP. P. AP A AGT7G A AT TT T A 

U VJ VjiA VJi-lJ 1VJ U Vj-M-rt U LJ^\ 


346 


T TPP A PP APP AH AT TAT 7 
U VjVj^oVjrtVjVj.rtVj./-i.VJ.rt VJ 


347 


TTPPAPTTA ATTAAA7TPTT 
U vj VjjriVj U /xri U nrtrt vJ V^ U 


348 


TTPPPAPTTPTTAPPAPA 

UVjVjL~tt.VJ U Vw U-r^VjlV^JAVj/^. 


349 


TTPPPTTPTTPPA a aPATT 
UvjvjL- U Vj U vjvj>4>LH.vj^iU 


350 


TTPPPTTPTTPPTTATTATTA 
U vjvj v_ U vj U vj vj U M. Un U M. 


351 


1TPPPAPPAPPAPPAA 

UvJVJ VJ AVJ Vw~r\vj V^fVVj O/vrt. 


352 


T 7GGGGT 7 A C CUGUGUG 

VJ VJVJ VJVJ VJ Jt\ V_ V» U VJ VJ VJ VJ 


353 


UGGGU CACAGUCUAU 


354 


UGGGUUCUUGGGAGC 


355 


TTPGTTATTAGTTGPAAGA 


356 


t TPPT T AT TAT T A A A A AT T A 
U vjvj U/\ Ui-i. U ftrtfirtrt VJ f\ 


357 


T7PTTAPAPATTGGAATTTT 

VJ Vj U.rtV_/lLV_rt VJ VJVJ/-ittVj VJ 


358 


TTPTTAPPPAPAPAPPP 


359 


TTPTTATTPPPPPTTPPPA 
Uvj^AUVjv-v^v_v_UUv^V^JA 


360 


tt/^tt/^appa paPTTapa 




tt/~it t/^t Tnr'TTaTTaPT tpp 
U vj U v. U vj vj Ujn vj Avj u Vj v_ 


JO* 


TTPTTPPATTPAPATTPPTT 
U vj U vj v-rt U «wJ-ivjJ-i. vj vj v» u 


363 


T TPT TPPPT TPT TT TP AG PT 7 

UVjVJVjv*V a ,VJV~U VJV~.CVVJV^VJ 


364 


TTPTTPPA A A AAT7AACA 


365 • 


T TPT TGG A A A G At TAG CI 7 

VJ \J VJ VJVJ/i_^T-/i VJ A* vV.** V- l_y 


366 


T TPT TPP A A PT TT TPT TGGG 
UvjUvjvj/\riv^U vjv-»vjvjvj\j 


367 


T TPT TPPT TAT TAT TA A A A A 
U Vj U vjvj Ui^UM.VJ^iJT-rtJ-\i-i 


368 


UGUGUAC CCACAG AC 


369 


UGUUCCUUGGGUUCU 


370 


UGUUGCAACUCACAG 


371 


UUAACAUGUGGAAAA 


372 


UUAAGAAUAGUUUUU 


373 


UUAGGCAGGGAUACU 


374 


UUAUAUAAAUAUAAA 


375 
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UUAUGGGGUACCUGU 


376 


UUAUUGUCUGGUAUA 


377 


UUCAGAC CUGGAGGA 


378 


UUCAUAAUGAUAGUA 


379 


UUCCCAUACAUUAUU 


380 


UUCCUUGGGUUCUUG 


381 


UUCUUGGGAGCAGCA 


382 


UUGCAACUCACAGUC 


383 


UUG GAGAAGUG AAUU 


384 


UUGGAGUAAUAAAUC 


385 


UUGGGAGCAGCAGGA 


386 


UUGGGUUCUUGGGAG 


387 


UUGUCUGGUAUAGUG 


388 


UUGUGCAUCAGAUGC 


389 


UUUAACAUGUGGAAA 


390 


UUUAUGGGAUCAAAG 


391 


UUUGCUGAGGGCUAU 


! 392 


UUUGUGCAUCAGAUG 


393 


UUUUAAC AUGUG G AA 


3 94 


UUUUGUGCAUCAGAU 


395 
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Table IV: HIV env Target and Hammerhead Sequence 



Substrate 


Seq 
ID 


Hammerhead Ribozyme 


Seq 

T*n 
±u 


AAAAAUA A CAUGGUA 


1 


UACCAUG CUGAUGAGGCCGUUAGGCCGAA UAUUUUU 


505 


AAAAAUA U UCAUAAU 


2 


AUUAUGA CUGAUGAGGCCGUUAGGCCGAA UAUUUUU 


506 


AAAUAUA A AGUAGUA 


11 


UACUACU CUGAUGAGGCCGUUAGGCCGAA UAUAUUU 


507 


AAGCCUA A AGCCAUG 


20 


CAUGGCU CUGAUGAGGCCGUUAGGCCGAA UAGGCUU 


508 


AAUAUUC A UAAUGAU 


24 


AUCAUUA CUGAUGAGGCCGUUAGGCCGAA GAAUAUU 


509 


AAUGAUA G UAGGAGG 


26 


CCUCCUA CUGAUGAGGCCGUUAGGCCGAA UAUCAUU 


510 


AAUUAUA U AAAUAUA 


31 


UAUAUUU CUGAUGAGGCCGUUAGGCCGAA UAUAAUU 


511 


ACAAUUA U UGUCUGG 


36 


CCAGACA CUGAUGAGGCCGUUAGGCCGAA UAAUUGU 


512 


AGAGUUA G GCAGGGA 


61 


UCCCUGC CUGAUGAGGCCGUUAGGCCGAA UAACUCU 


513 


AGCACUA U GGGCGCA 


64 


UGCGCCC CUGAUGAGGCCGUUAGGCCGAA UAGUGCU 


514 


AGGAGUA G CACCCAC 


72 


GUGGGUG CUGAUGAGGCCGUUAGGCCGAA UACUCCU 


515 


AUAAAUA U AAAGUAG 


84 


CUACUUU CUGAUGAGGCCGUUAGGCCGAA UAUUUAU 


516 


AUCAGUU U AUGGGAU 


97 


AUCCCAU CUGAUGAGGCCGUUAGGCCGAA AACUGAU 


517 


AUUCAUA A UGAUAGU 


110 


ACUAUCA CUGAUGAGGCCGUUAGGCCGAA UAUGAAU 


518 


CAAUGUA C ACAUGGA 


121 


UCCAUGU CUGAUGAGGCCGUUAGGCCGAA UACAUUG 


519 ! 


CAAUGUA U GCCCCUC 


122 


GAGGGGC CUGAUGAGGCCGUUAGGCCGAA UACAUUG 


520 


CACAGUC U AUUAUGG 


127 


CCAUAAU CUGAUGAGGCCGUUAGGCCGAA GACUGUG 


521 


CACAGUC U GGGGCAU 


128 


AUGCCCC CUGAUGAGGCCGUUAGGCCGAA GACUGUG 


522 


! CAGUCUA U UAUGGGG 


142 


CCCCAUA CUGAUGAGGCCGUUAGGCCGAA UAGACUG 1 


523 


CAGUUUA U GGGAUCA 


143 


UGAUCCC CUGAUGAGGCCGUUAGGCCGAA UAAACUG 


524 


CCAAUUC C CAUACAU 


149 


AUGUAUG CUGAUGAGGCCGUUAGGCCGAA GAAUUGG 


525 


CCUCUUC A GCUACCA 


161 


UGGUAGC CUGAUGAGGCCGUUAGGCCGAA GAAGAGG 


526 


CUAUUUU G UGCAUCA 


174 


UGAUGCA CUGAUGAGGCCGUUAGGCCGAA AAAAUAG 


527 


CUGUGUA C CCACAGA 


186 


UCUGUGG CUGAUGAGGCCGUUAGGCCGAA UACACAG 


528 


GACAAUU A UUGUCUG 


199 


CAGACAA CUGAUGAGGCCGUUAGGCCGAA AAUUGUC 


529 


GACAAUU G GAGAAGU 


200 


ACUUCUC CUGAUGAGGCCGUUAGGCCGAA AAUUGUC 


530 


GACGGUA C AGGCCAG 


203 


CUGGCCU CUGAUGAGGCCGUUAGGCCGAA UACCGUC 


531 


GAUAGUA G GAGGCUU 


212 


AAGCCUC CUGAUGAGGCCGUUAGGCCGAA UACUAUC 


532 


GAUGCUA A AGCAUAU 


215 


AUAUGCU CUGAUGAGGCCGUUAGGCCGAA UAGCAUC 


533 


GCAACUC A CAGUCUG 


216 


CAGACUG CUGAUGAGGCCGUUAGGCCGAA GAGUUGC 


534 


GCCUCUU C AGCUACC 


226 


GGUAGCU CUGAUGAGGCCGUUAGGCCGAA AAGAGGC 


535 


GGCAGUC U AGCAGAA 


244 


UUCUGCU CUGAUGAGGCCGUUAGGCCGAA GACUGCC 


536 


GGUUCUU G GGAGCAG 


259 


CUGCUCC CUGAUGAGGCCGUUAGGCCGAA AAGAACC 


537 


GUAUAUA A AAAUAUU 


266 


AAUAUUU CUGAUGAGGCCGUUAGGCCGAA UAUAUAC 


538 


GUCAAUA A CGCUGAC 


268 


GUCAGCG CUGAUGAGGCCGUUAGGCCGAA UAUUGAC 


539 


GUCUAUU A UGGGGUA 


271 


UACCCCA CUGAUGAGGCCGUUAGGCCGAA AAUAGAC 


540 


GUGAAUU A UAUAAAU 


273 


AUUUAUA CUGAUGAGGCCGUUAGGCCGAA AAUUCAC 


541 


GUGCAUC A GAUGCUA 


274 


UAGCAUC CUGAUGAGGCCGUUAGGCCGAA GAUGCAC 


542 


GUGCCUC U UCAGCUA 


275 


UAGCUGA CUGAUGAGGCCGUUAGGCCGAA GAGGCAC 


543 


GUGGGUC A CAGUCUA 


279 


UAGACUG CUGAUGAGGCCGUUAGGCCGAA GACCCAC 


544 


GUUCCUU G GGUUCUU 


283 


AAGAACC CUGAUGAGGCCGUUAGGCCGAA AAGGAAC 


545 


UAGAGUU A GGCAGGG 


300 


CCCUGCC CUGAUGAGGCCGUUAGGCCGAA AACUCUA 


546 


UAUAAUC A GUUUAUG 


306 


CAUAAAC CUGAUGAGGCCGUUAGGCCGAA GAUUAUA 


547 


UAUUGUC U GGUAUAG 


315 


CUAUACC CUGAUGAGGCCGUUAGGCCGAA GACAAUA 


548 


UCAGUUU A UGGGAUC 


324 


GAUCCCA CUGAUGAGGCCGUUAGGCCGAA AAACUGA 


549 


UCCCAUA C AUUAUUG 


327 


CAAUAAU CUGAUGAGGCCGUUAGGCCGAA UAUGGGA 


550 


UCUAUUA U GGGGUAC 


330 


GUACCCC CUGAUGAGGCCGUUAGGCCGAA UAAUAGA 


551 


UCUGGUA U AGUGCAA 


331 


UUGCACU CUGAUGAGGCCGUUAGGCCGAA UACCAGA 


552 


UGAAUUA U AUAAAUA 


334 


UAUUUAU CUGAUGAGGCCGUUAGGCCGAA UAAUUCA 


553 


UGGAGUA A UAAAUCU 


348 


AGAUUUA CUGAUGAGGCCGUUAGGCCGAA UACUCCA 


554 
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UGGGGUA C CUGUGUG 


353 


CACACAG CUG AUGAGGC C GUUAGGC CGAA UACCCCA 


555 


UGGGUUC U UGGGAGC 


355 


GCUCCCA CUGAUGAGG CCGUUAGGCCGAA GAACCCA 


556 


UGGUAUA G UGCAACA 


356 


UGUUGCA CUGAUGAGGCCGUUAGGCCGAA UAUACCA 


557 


UGGUAUA U AAAAAUA 


357 


UAUUUUU CUGAUGAGGCCGUUAGGCCGAA UAUACCA 


558 


UGUGGUA U AUAAAAA 


368 


UUUUUAU CUGAUGAGGCCGUUAGGCCGAA UACCACA 


559 


UUAUAUA A AUAUAAA 


375 


UUUAUAU CUGAUGAGGCCGUUAGGCCGAA UAUAUAA 


560 


UUGGGUU C UUGGGAG 


387 


CUCCCAA CUGAUGAGGCCGUUAGGCCGAA AACCCAA 


561 
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Table V: HIV env Target and Inozyme Sequence 



Substrate 


C? g~\ j-r 

1JJ 




Seq 
ID 


AAAGCCA U GUGUAAA 


8 


UUUACAC CUGAUGAGGCCGUUAGGCCGAA IGGCUUU 


DOZ 


AAAGCCU A AAGCCAU 


9 


AUGGCUU CUGAUGAGGCCGUUAGGCCGAA IGGCUUU 


a a 
bb J 


AAGCACU A UGGGCGC 


19 


GCGCCCA CUGAUGAGGCCGUUAGGCCGAA IGUGCUU 


bb4 


AAUGGCA G UCUAGCA 


27 


UGCUAGA CUGAUGAGGCCGUUAGGCCGAA IGCCAUU 




AAUGUCA G CACAGUA 


30 


UACUGUG CUGAUGAGGCCGUUAGGCCGAA IGACAUU 


c c c 

ODD 


AAUUCCC A UACAUUA 


32 


UAAUGUA CUGAUGAGGCCGUUAGGCCGAA IGGAAUU 


567 


ACAGACC C CAACCCA 


39 


UGGGUUG CUGAUGAGGCCGUUAGGCCGAA IGUCUGU 


568 


ACAGGCC A GACAAUU 


40 


AAUUGUC CUGAUGAGGCCGUUAGGCCGAA IGCCUGU 


569 


ACAGUCU A UUAUGGG 


42 


CCCAUAA CUGAUGAGGCCGUUAGGCCGAA IGACUGU 


570 


ACAUGCC U GUGUACC 


43 


GGUACAC CUGAUGAGGCCGUUAGGCCGAA IGCAUGU 


571 


ACCCACA G ACCCCAA 


46 


UUGGGGU CUGAUGAGGCCGUUAGGCCGAA IGUGGGU 


572 


ACUCACA G UCUGGGG 


52 


CCCCAGA CUGAUGAGGCCGUUAGGCCGAA IGUGAGU 


573 


AGACCCC A ACCCACA 1 


57 


UGUGGGU CUGAUGAGGCCGUUAGGCCGAA IGGGUCU 


574 


AGAUGCU A AAGCAUA 


62 


UAUGCUU CUGAUGAGGCCGUUAGGCCGAA IGCAUCU 


575 


AGCAGCA G GAAGCAC 


65 


GUGCUUC CUGAUGAGGCCGUUAGGCCGAA IGCUGCU 


576 


AGCUCCA G GCAAGAG 


69 


CUCUUGC CUGAUGAGGCCGUUAGGCCGAA IGGAGCU 


577 


AGGAUCA A CAGCUCC 


73 


GGAGCUG CUGAUGAGGCCGUUAGGCCGAA IGAUCCU 


578 


AGGGACA A UUGGAGA 


76 


UCUCCAA CUGAUGAGGCCGUUAGGCCGAA IGUCCCU 


579 


AUAAUCA G UUUAUGG 


86 


CCAUAAA CUGAUGAGGCCGUUAGGCCGAA IGAUUAU 


580 


AUAUUCA U AAUGAUA 


93 


UAUCAUU CUGAUGAGGCCGUUAGGCCGAA IGAAUAU 


581 


AUCAACA G CUCCUAG 


95 


CUAGGAG CUGAUGAGGCCGUUAGGCCGAA IGUUGAU 


582 


AUGUACA C AUGGAAU 


103 


AUUCCAU CUGAUGAGGCCGUUAGGCCGAA IGUACAU 


583 


AUUAACA A GAGAUGG 


107 


CCAUCUC CUGAUGAGGCCGUUAGGCCGAA IGUUAAU 


584 


AUUCCCA U ACAUUAU 


111 


AUAAUGU CUGAUGAGGCCGUUAGGCCGAA IGGGAAU 


585 


AUUGUCU G GUAUAGU 


113 


ACUAUAC CUGAUGAGGCCGUUAGGCCGAA IGACAAU 


586 


AUUUGCU G AGGGCUA 


114 


UAGCCCU CUGAUGAGGCCGUUAGGCCGAA IGCAAAU 


587 


CAAAGCC U AAAGCCA 


118 


UGGCUUU CUGAUGAGGCCGUUAGGCCGAA IGCUUUG 


588 


CAACUCA C AGUCUGG 


119 


CCAGACU CUGAUGAGGCCGUUAGGCCGAA IGAGUUG 


589 


CAAUUCC C AUACAUU 


123 


AAUGUAU CUGAUGAGGCCGUUAGGCCGAA IGAAUUG 


590 


CAGACCC C AACCCAC 


132 


GUGGGUU CUGAUGAGGCCGUUAGGCCGAA IGGUCUG 


591 


CAGACCU G GAGGAGG 


133 


CCUCCUC CUGAUGAGGCCGUUAGGCCGAA IGGUCUG 


592 


CAGCACA G UACAAUG 


135 


CAUUGUA CUGAUGAGGCCGUUAGGCCGAA IGUGCUG 


593 


CAGCUCC A GGCAAGA 


137 


UCUUGCC CUGAUGAGGCCGUUAGGCCGAA IGAGCUG 


594 ; 


CAGGCCA G ACAAUUA 


140 


UAAUUGU CUGAUGAGGCCGUUAGGCCGAA IGGCCUG 


595 


CAGUACA A UGUACAC 


141 


GUGUACA CUGAUGAGGCCGUUAGGCCGAA IGUACUG 


596 


CAUGCCU G UGUACCC 


147 


GGGUACA CUGAUGAGGCCGUUAGGCCGAA IGGCAUG 


597 


CCAGACA A UUAUUGU 


152 


ACAAUAA CUGAUGAGGCCGUUAGGCCGAA IGUCUGG 


598 


CCAGGCA A GAGUCCU 


153 


AGGACUC CUGAUGAGGCCGUUAGGCCGAA IGCCUGG 


599 


CCAUACA U UAUUGUG 


154 


CACAAUA CUGAUGAGGCCGUUAGGCCGAA IGUAUGG 


600 


CCCAACC C ACAAGAA 


155 


UUCUUGU CUGAUGAGGCCGUUAGGCCGAA IGUUGGG 


601 


CCUGGCU G UGGAAAG 


163 


CUUUCCA CUGAUGAGGCCGUUAGGCCGAA IGCCAGG 


602 


CGGUACA G GCCAGAC 


169 


GUCUGGC CUGAUGAGGCCGUUAGGCCGAA IGUACCG 


603 


CUCUUCA G CUACCAC 


178 


GUGGUAG CUGAUGAGGCCGUUAGGCCGAA IGAAGAG 


604 


CUGUGCC U CUUCAGC 


183 


GCUGAAG CUGAUGAGGCCGUUAGGCCGAA IGCACAG 


605 


GACCCCA A CCCACAA 


201 


UUGUGGG CUGAUGAGGCCGUUAGGCCGAA IGGGGUC 


606 


GGAAGCA C UAUGGGC 


235 


GCCCAUA CUGAUGAGGCCGUUAGGCCGAA IGCUUCC 


607 


GGAGCCU G UGCCUCU 


239 


AGAGGCA CUGAUGAGGCCGUUAGGCCGAA IGGCUCC 


608 


GGGAGCA G CAGGAAG 


248 


CUUCCUG CUGAUGAGGCCGUUAGGCCGAA IGCUCCC 


609 


GGGGACC C GACAGGC 


249 


GCCUGUC CUGAUGAGGCCGUUAGGCCGAA IGUCCCC 


610 


GGGUACC U GUGUGGA 


251 


UCCACAC CUGAUGAGGCCGUUAGGCCGAA IGUACCC 


611 



100 



WO 03/102131 



PCT/US03/12626 



GGGUUCU U GGGAGCA 


253 


UGCUCCC CUGAUGAGGCCGUUAGGCCGAA IGAACCC 


612 


GGUACCU G UGUGGAA 


255 


UUCCACA CUGAUGAGGCCGUUAGGCCGAA IGGUACC 


613 


GGUCACA G UCUAUUA 


258 


UAAUAGA CUGAUGAGGCCGUUAGGCCGAA IGUGACC 


614 


GUACACA U GGAAUUA 


261 


UAAUUCC CUGAUGAGGCCGUUAGGCCGAA IGUGUAC 


615 


nil AC CCA C AGACCCC 


263 


GGGGUCU CUGAUGAGGCCGUUAGGCCGAA IGGGUAC 


616 


niTATTrtrp c cucccau 


267 


AUGGGAG CUGAUGAGGCCGUUAGGCCGAA IGCAUAC 


617 


GUCAGCA C AGUACAA 


270 


UUGUACU CUGAUGAGGCCGUUAGGCCGAA IGCUGAC 


618 


GTTGUACC C ACAGACC 


281 


GGUCUGU CUGAUGAGGCCGUUAGGCCGAA IGUACAC 


619 


UAAAG CC A UGUGUAA 


288 


UUACACA CUGAUGAGGCCGUUAGGCCGAA IGCUUUA 


620 


TTAACGCU G ACGGUAC 


291 


GUACCGU CUGAUGAGGCCGUUAGGCCGAA IGCGUUA 


621 


TTAPCACC G CUUGAGA 


296 


UCUCAAG CUGAUGAGGCCGUUAGGCCGAA IGUGGUA 


622 


UAGUGCA A CAGCAAA 


303 


UUUGCUG CUGAUGAGGCCGUUAGGCCGAA IGCACUA 


623 


UAUGCCC C UCCCAUC 


310 


GAUGGGA CUGAUGAGGCCGUUAGGCCGAA IGGCAUA 


624 


UCAGACC U GGAGGAG 


321 


CUCCUCC CUGAUGAGGCCGUUAGGCCGAA IGUCUGA 


625 


UGCAACU C ACAGUCU 


338 


AGACUGU CUGAUGAGGCCGUUAGGCCGAA IGUUGCA 


626 


UGCAUCA G AUGCUAA 


339 


UUAGCAU CUGAUGAGGCCGUUAGGCCGAA IGAUGCA 


627 


UGCCUCU U CAGCUAC 


340 


GUAGCUG CUGAUGAGGCCGUUAGGCCGAA IGAGGCA 


628 


UGGAACU U CUGGGAC 


345 


GUCCCAG CUGAUGAGGCCGUUAGGCCGAA IGUUCCA 


629 


UGGGUCA C AGUCUAU 


354 


AUAGACU CUGAUGAGGCCGUUAGGCCGAA IGACCCA 


630 


UGUACCC A CAGACCC 


359 


GGGUCUG CUGAUGAGGCCGUUAGGCCGAA IGGUACA 


631 


UGUGCCU C UUCAGCU 


364 


AGCUGAA CUGAUGAGGCCGUUAGGCCGAA IGGCACA 


632 


UGUUCCU U GGGUUCU 


370 


AGAACCC CUGAUGAGGCCGUUAGGCCGAA IGGAACA 


633 


UGUUGCA A CUCACAG 


371 


CUGUGAG CUGAUGAGGCCGUUAGGCCGAA IGCAACA 


634 


UUAGGCA G GGAUACU 


374 


AGUAUCC CUGAUGAGGCCGUUAGGCCGAA IGCCUAA 


635 


UUGUGCA U CAGAUGC 


389 


GCAUCUG CUGAUGAGGCCGUUAGGCCGAA IGCACAA 


636 


UUUAACA U GUGGAAA 


390 


UUUCCAC CUGAUGAGGCCGUUAGGCCGAA IGUUAAA 


637 
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Table VI: fflV env Target and G-eleaver Sequence 



Substrate 


Seq 
ID 


G-Cieaver KiJoozyme 


ID 


AACGCUG A CGGUACA 


14 


UGUACCG UGAUG GCAUGCACUAUGC GCG CAGCGUU 


C "5 Q 
OJO 


AAUAACG C UGACGGU 


22 


ACCGUCA UGAUG GCAUGCACUAUGC GCG CGUUAUU 


b jy 


ACACAUG C CUGUGUA 


38 


UACACAG UGAUG GCAUGCACUAUGC GCG CAUGUGU 


C A f\ 


ACCACCG C UUGAGAG 


45 


CUCUCAA UGAUG GCAUGCACUAUGC GCG CGGUGGU 


CA 1 


ACCUGUG U GGAAAGA 


49 


UCUUUCC UGAUG GCAUGCACUAUGC GCG CACAGGU 


C A O 


AGAAGUG A AUUAUAU 


55 


AUAUAAU UGAUG GCAUGCACUAUGC GCG CACUUCU 




AGCAAUG U AUGCCCC 


63 


GGGGCAU UGAUG GCAUGCACUAUGC GCG CAUUGCU 


644 


AUGCCUG U GUACCCA 


100 


UGGGUAC UGAUG GCAUGCACUAUGC GCG CAGGCAU 


CA c 


AUGUAUG C CCCUCCC 


104 


GGGAGGG UGAUG GCAUGCACUAUGC GCG CAUACAU 


C A C 


CAAUUUG C UGAGGGC 


125 


GCCCUCA UGAUG GCAUGCACUAUGC GCG CAAAUUG 


CA "7 


CAUAAUG A UAGUAGG 


144 


CCUACUA UGAUG GCAUGCACUAUGC GCG CAUUAUG 


O *x O 


CCGCUUG A GAGACUU 


159 


AAGUCUC UGAUG GCAUGCACUAUGC GCG CAAGCGG 


649 


CUGGCUG U GGAAAGA 


181 


UCUUUCC UGAUG GCAUGCACUAUGC GCG CAGCCAG 


650 


GAGCCUG U GCCUCUU 


208 


AAGAGGC UGAUG GCAUGCACUAUGC GCG CAGGCUC 


651 


GCCUGUG C CUCUUCA 


227 


UGAAGAG UGAUG GCAUGCACUAUGC GCG CACAGGC 


652 


GCCUGUG U ACCCACA 


228 


UGUGGGU UGAUG GCAUGCACUAUGC GCG CACAGGC 


653 


GUACCUG U GUGGAAA 


264 


UUUCCAC UGAUG GCAUGCACUAUGC GCG CAGGUAC 


654 


UAACAUG U GGAAAAA 


290 


UUUUUCC UGAUG GCAUGCACUAUGC GCG CAUGUUA 


655 


UACAAUG U ACACAUG 


294 


CAUGUGU UGAUG GCAUGCACUAUGC GCG CAUUGUA 


656 


UAUAGUG C AACAGCA 


307 


UGCUGUU UGAUG GCAUGCACUAUGC GCG CACUAUA 


657 


UAUUUUG U GCAUCAG 


316 


CUGAUGC UGAUG GCAUGCACUAUGC GCG CAAAAUA 


658 


UCAGAUG C UAAAGCA 


322 


UGCUUUA UGAUG GCAUGCACUAUGC GCG CAUCUGA 


659 


UUUGCUG A GGGCUAU 


392 


AUAGCCC UGAUG GCAUGCACUAUGC GCG CAGCAAA 


660 


UUUUGUG C AUCAGAU 


395 


AUCUGAU UGAUG GCAUGCACUAUGC GCG CACAAAA | 661 
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Table VII: HIV env Target and Zinzyme Sequence 



Substrate 


— — r 

Seq 
ID 




Seq 
ID 


AAUAACG C UGACGGU 


22 


ACCGUCA GCCGAAAGGCGAGUGAGGUCU CGUUAUU 


C CO 

b b z 


AAUAGAG U UAGGCAG 


23 


CUGCCUA GCCGAAAGGCGAGUGAGGUCU CUCUAUU 


bb j | 


ACACAUG C CUGUGUA 


38 


UACACAG GCCGAAAGGCGAGUGAGGUCU CAUGUGU 


CCA 


ACCACCG C UUGAGAG 


45 


CUCUCAA GCCGAAAGGCGAGUGAGGUCU CGGUGGU 


c c c 
bob 


ACCUGUG U GGAAAGA 


49 


UCUUUCC GCCGAAAGGCGAGUGAGGUCU CACAGGU 


666 


AGCAAUG U AUGCCCC 


63 


GGGGCAU GCCGAAAGGCGAGUGAGGUCU CAUUGCU 


667 


AGUUAGG C AGGGAUA 


80 


UAUCCCU GCCGAAAGGCGAGUGAGGUCU CCUAACU 


668 


AUGAUAG U AGGAGGC 


99 


GCCUCCU GCCGAAAGGCGAGUGAGGUCU CUAUCAU 


669 


AUGCCUG U GUACCCA 


100 


UGGGUAC GCCGAAAGGCGAGUGAGGUCU CAGGCAU 


670 | 


AUGGCAG U CUAGCAG 


101 


CUGCUAG GCCGAAAGGCGAGUGAGGUCU CUGCCAU 


671 


AUGUAUG C CCCUCCC 


104 


GGGAGGG GCCGAAAGGCGAGUGAGGUCU CAUACAU 


672 


AUGUCAG C ACAGUAC 


105 


GUACUGU GCCGAAAGGCGAGUGAGGUCU CUGACAU 


673 


CAAUUUG C UGAGGGC 


125 


GCCCUCA GCCGAAAGGCGAGUGAGGUCU CAAAUUG 


674 


CAGGAAG C ACUAUGG 


138 


CCAUAGU GCCGAAAGGCGAGUGAGGUCU CUUCCUG 


675 


CCUAAAG C CAUGUGU 


160 


ACACAUG GCCGAAAGGCGAGUGAGGUCU CUUUAGG 


676 


CCUUGGG U UCUUGGG 


166 


CCCAAGA GCCGAAAGGCGAGUGAGGUCU CCCAAGG 


677 


CUCACAG U CUGGGGC 


175 


GCCCCAG GCCGAAAGGCGAGUGAGGUCU CUGUGAG 


678 


CUCCAGG C AAGAGUC 


176 


GACUCUU GCCGAAAGGCGAGUGAGGUCU CCUGGAG 


679 


CUGACGG U ACAGGCC 


179 


GGCCUGU GCCGAAAGGCGAGUGAGGUCU CCGUCAG 


680 


CUGGCUG U GGAAAGA 


181 


UCUUUCC GCCGAAAGGCGAGUGAGGUCU CAGCCAG 


681 


CUUUGAG C CAAUUCC 


191 


GGAAUUG GCCGAAAGGCGAGUGAGGUCU CUCAAAG 


682 


GAGCCUG U GCCUCUU 


208 


AAGAGGC GCCGAAAGGCGAGUGAGGUCU CAGGCUC 


683 


GCAAGAG U CCUGGCU 


217 


AGCCAGG GCCGAAAGGCGAGUGAGGUCU CUCUUGC 


684 


GCCUGUG C CUCUUCA 


227 


UGAAGAG GCCGAAAGGCGAGUGAGGUCU CACAGGC 


685 


GCCUGUG U ACCCACA 


228 


UGUGGGU GCCGAAAGGCGAGUGAGGUCU CACAGGC 


686 


GCUGUGG U AUAUAAA 


233 


UUUAUAU GCCGAAAGGCGAGUGAGGUCU CCACAGC 


687 


GGAGAAG U GAAUUAU 


237 


AUAAUUC GCCGAAAGGCGAGUGAGGUCU CUUCUCC 


688 


GGAGCAG C AGGAAGC 


, 238 


GCUUCCU GCCGAAAGGCGAGUGAGGUCU CUGCUCC 


689 


GGUAUAG U GCAACAG 


256 


CUGUUGC GCCGAAAGGCGAGUGAGGUCU CUAUACC 


690 


GUACAGG C CAGACAA 


262 


UUGUCUG GCCGAAAGGCGAGUGAGGUCU CCUGUAC 


691 


GUACCUG U GUGGAAA 


264 


UUUCCAC GCCGAAAGGCGAGUGAGGUCU CAGGUAC 


692 


GUCACAG U CUAUUAU 


269 


AUAAUAG GCCGAAAGGCGAGUGAGGUCU CUGUGAC 


693 


UAACAUG U GGAAAAA 


290 


UUUUUCC GCCGAAAGGCGAGUGAGGUCU CAUGUUA 


694 


UAAUCAG U UUAUGGG 


292 


CCCAUAA GCCGAAAGGCGAGUGAGGUCU CUGAUUA 


695 


UACAAUG U ACACAUG 


294 


CAUGUGU GCCGAAAGGCGAGUGAGGUCU CAUUGUA 


696 


UAUAGUG C AACAGCA 


307 


UGCUGUU GCCGAAAObLCjAuUbAvjoUCU laluaua 


697 


UAUGGGG U ACCUGUG 


311 


CACAGGU GCCGAAAGGCGAGUGAGGUCU CCCCAUA 


698 


UAUUUUG U GCAUCAG 


316 


CUGAUGC GCCGAAAGGCGAGUGAGGUCU CAAAAUA 


699 


UCAACAG C UCCUAGG 


317 


CCUAGGA GCCGAAAGGCGAGUGAGGUCU CUGUUGA 


700 


UCAGAUG C UAAAGCA 


322 


UGCUUUA GCCGAAAGGCGAGUGAGGUCU CAUCUGA 


701 


UCUUCAG C UACCACC 


1 332 


GGUGGUA GCCGAAAGGCGAGUGAGGUCU CUGAAGA 


702 


UGUCUGG U AUAGUGC 


362 


GCACUAU GCCGAAAGGCGAGUGAGGUCU CCAGACA 


703 


UUGGGAG C AGCAGGA 


386 


UCCUGCU GCCGAAAGGCGAGUGAGGUCU CUCCCAA 


704 


UUUUGUG C AUCAGAU 


395 


AUCUGAU GCCGAAAGGCGAGUGAGGUCU CACAAAA 


705 
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Table VIII: fflV env Target and DNAzyme Sequence 



Substrate 


ID 




Sea 
ID 


AAAAAUA U UCAUAAU 


2 ! 


ATTATGA GGCTAGCTACAACGA TATTTTT 


706 


AAAAGAA U GAACAAG 


3 


CTTGTTC GGCTAGCTACAACGA TTCTTTT 


707 


AAAAUAA C AUGGUAG 


4 


CTACCAT GGCTAGCTACAACGA TTATTTT 


708 


AAAGCCA U GUGUAAA 


8 


TTTACAC GGCTAGCTACAACGA TGGCTTT 


709 


AACAUGA C CUGGAUG 


12 


CATCCAG GGCTAGCTACAACGA TCATGTT 


710 


AAGUGAA U UAUAUAA 


21 


TTATATA GGCTAGCTACAACGA TTCACTT 


711 


AAUAACG C UGACGGU 


22 


ACCGTCA GGCTAGCTACAACGA CGTTATT 


712 


AAUAGAG U UAGGCAG 


23 


CTGCCTA GGCTAGCTACAACGA CTCTATT 


713 


AAUUAUA U AAAUAUA 


31 


TATATTT GGCTAGCTACAACGA TATAATT 


714 


ACAAUUA U UGUCUGG 


36 


CCAGACA GGCTAGCTACAACGA TAATTGT 


715 


ACACAUG C CUGUGUA 


38 


TACACAG GGCTAGCTACAACGA CATGTGT 


716 


ACCACCG C UUGAGAG 


45 


CTCTCAA GGCTAGCTACAACGA CGGTGGT 


717 


ACCCCAA C CCACAAG 


47 


CTTGTGG GGCTAGCTACAACGA TTGGGGT 


718 


ACCUGUG U GGAAAGA 


49 


TCTTTCC GGCTAGCTACAACGA CACAGGT 


719 


ACGCUGA C GGUACAG 


50 


CTGTACC GGCTAGCTACAACGA TCAGCGT 


720 


AGCAAUG U AUGCCCC 


63 


GGGGCAT GGCTAGCTACAACGA CATTGCT 


721 


AGCACUA U GGGCGCA 


64 


TGCGCCC GGCTAGCTACAACGA TAGTGCT 


722 


AGUACAA U GUACACA 


77 


TGTGTAC GGCTAGCTACAACGA TTGTACT 


723 


AGUUAGG C AGGGAUA 


80 


TATCCCT GGCTAGCTACAACGA CCTAACT 


724 


AUAAAAA U AUUCAUA 


83 


TATGAAT GGCTAGCTACAACGA TTTTTAT 


725 


AUAAAUA U AAAGUAG 


84 


CTACTTT GGCTAGCTACAACGA TATTTAT 


726 


AUAAUGA U AGUAGGA 


87 


TCCTACT GGCTAGCTACAACGA TCATTAT 


727 


AUAUAAA U AUAAAGU 


90 


ACTTTAT GGCTAGCTACAACGA TTTATAT 


728 


AUAUUCA U AAUGAUA 


93 


TATCATT GGCTAGCTACAACGA TGAATAT 


729 


AUGAUAG U AGGAGGC 


99 


GCCTCCT GGCTAGCTACAACGA CTATCAT 


730 


AUGCCUG U GUACCCA 


100 


TGGGTAC GGCTAGCTACAACGA CAGGCAT 


731 


AUGGCAG U CUAGCAG 


101 


CTGCTAG GGCTAGCTACAACGA CTGCCAT 


732 


AUGUACA C AUGGAAU 


103 


ATTCCAT GGCTAGCTACAACGA TGTACAT 


733 


AUGUAUG C CCCUCCC 


104 


GGGAGGG GGCTAGCTACAACGA CATACAT 


734 


AUGUCAG C ACAGUAC 


105 


GTACTGT GGCTAGCTACAACGA CTGACAT 


735 


AUUCCCA U ACAUUAU 


111 


ATAATGT GGCTAGCTACAACGA TGGGAAT 


736 


AUUUUAA C AUGUGGA 


115 


TCCACAT GGCTAGCTACAACGA TTAAAAT 


737 


CAACUCA C AGUCUGG 


119 


CCAGACT GGCTAGCTACAACGA TGAGTTG 


738 


CAAUGUA C ACAUGGA 


121 


TCCATGT GGCTAGCTACAACGA TACATTG 


739 


CAAUGUA U GCCCCUC 


122 


GAGGGGC GGCTAGCTACAACGA TACATTG 


740 


CAAUUUG C UGAGGGC 


125 


GCCCTCA GGCTAGCTACAACGA CAAATTG 


741 


CAGACAA U UAUUGUC 


131 


GACAATA GGCTAGCTACAACGA TTGTCTG 


742 


CAGGAAG C ACUAUGG 


138 


CCATAGT GGCTAGCTACAACGA CTTCCTG 


743 


CAGUCUA U UAUGGGG 


142 


CCCCATA GGCTAGCTACAACGA TAGACTG 


744 


CAGUUUA U GGGAUCA 


143 


TGATCCC GGCTAGCTACAACGA TAAACTG 


745 


CAUCAGA U GCUAAAG 


146 


CTTTAGC GGCTAGCTACAACGA TCTGATG 


746 


CCACAGA C CCCAACC 


150 


GGTTGGG GGCTAGCTACAACGA TCTGTGG 


747 


CCAUACA U UAUUGUG 


154 


CACAATA GGCTAGCTACAACGA TGTATGG 


748 


CCUAAAG C CAUGUGU 


160 


ACACATG GGCTAGCTACAACGA CTTTAGG 


749 


CCUUGGG U UCUUGGG 


166 


CCCAAGA GGCTAGCTACAACGA CCCAAGG 


750 


CUCACAG U CUGGGGC 


175 


GCCCCAG GGCTAGCTACAACGA CTGTGAG 


751 


CUCCAGG C AAGAGUC 


176 


GACTCTT GGCTAGCTACAACGA CCTGGAG 


752 


CUGACGG U ACAGGCC 


179 


GGCCTGT GGCTAGCTACAACGA CCGTCAG 


753 


CUGGCUG U GGAAAGA 


181 


TCTTTCC GGCTAGCTACAACGA CAGCCAG 


754 


CUGUGUA C CCACAGA 


186 


TCTGTGG GGCTAGCTACAACGA TACACAG 


755 
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CUUCAGA C CUGGAGG 


187 


CCTCCAG GGCTAGCTACAACGA TCTGAAG 


756 


CUUUGAG C CAAUUCC 




GGAATTG GGCTAGCTACAACGA CTCAAAG 


757 


GACGGUA C AGGCCAG 


203 


CTGGCCT GGCTAGCTACAACGA TACCGTC 


758 


GAGCCUG U GCCUCUU 


208 


AAGAGGC GGCTAGCTACAACGA CAGGCTC 


759 


GAUAUAA U CAGUUUA 


213 


TAAACTG GGCTAGCTACAACGA TTATATC 


760 


GCAAGAG U CCUGGCU 


217 


AGCCAGG GGCTAGCTACAACGA CTCTTGC 


761 


GCAGGGA U ACUCACC 


222 


GGTGAGT GGCTAGCTACAACGA TCCCTGC 


762 


GCCUGUG C CUCUUCA 


227 


TGAAGAG GGCTAGCTACAACGA CACAGGC 


763 


GCCUGUG U ACCCACA I 228 


TGTGGGT GGCTAGCTACAACGA CACAGGC 


764 


GCUGUGG U AUAUAAA 


233 


TTTATAT GGCTAGCTACAACGA CCACAGC 


765 


GGAAAAA U AACAUGG 


234 


CCATGTT GGCTAGCTACAACGA TTTTTCC 


766 


GGAAGCA C UAUGGGC 


235 


GCCCATA GGCTAGCTACAACGA TGCTTCC 


767 


GGAGAAG U GAAUUAU 


237 


ATAATTC GGCTAGCTACAACGA CTTCTCC 


768 


GGAGCAG C AGGAAGC 


238 


GCTTCCT GGCTAGCTACAACGA CTGCTCC 


769 


GGAUCAA C AGCUCCU 


241 


AGGAGCT GGCTAGCTACAACGA TTGATCC 


770 


GGGACAA U UGGAGAA 


247 


TTCTCCA GGCTAGCTACAACGA TTGTCCC 


771 


GGUAUAG U GCAACAG 


256 


CTGTTGC GGCTAGCTACAACGA CTATACC 


772 


GUACACA U GGAAUUA 


261 


TAATTCC GGCTAGCTACAACGA TGTGTAC 


773 


GUACAGG C CAGACAA 


262 


TTGTCTG GGCTAGCTACAACGA CCTGTAC 


774 


GUACCCA C AGACCCC 


263 


GGGGTCT GGCTAGCTACAACGA TGGGTAC 


775 


GUACCUG U GUGGAAA 


264 


TTTCCAC GGCTAGCTACAACGA CAGGTAC 


776 


GUCACAG U CUAUUAU 


269 


ATAATAG GGCTAGCTACAACGA CTGTGAC 


777 


GUCAGCA C AGUACAA 


270 


TTGTACT GGCTAGCTACAACGA TGCTGAC 


778 


GUUGCAA C UCACAGU 


285 


ACTGTGA GGCTAGCTACAACGA TTGCAAC 


779 


UAACAUG U GGAAAAA 


290 


TTTTTCC GGCTAGCTACAACGA CATGTTA 


780 


UAAUCAG U UUAUGGG 


292 


rrraTaa GGCTAGCTACAACGA CTGATTA 


781 


UACAAUG U ACACAUG 


294 


CATGTGT GGCTAGCTACAACGA CATTGTA 


782 


UAUAGUG C AACAGCA 


307 


TGCTGTT GGCTAGCTACAACGA CACTATA 


783 


UAUGGGG U ACCUGUG 


311 


rananfiT GGCTAGCTACAACGA CCCCATA 


784 


UAUUUUG U GCAUCAG 


316 


PTHATGC GGCTAGCTACAACGA CAAAATA 


785 


UCAACAG C UCCUAGG 


317 


pgtagga GGCTAGCTACAACGA CTGTTGA 


786 


UCAAUAA C GCUGACG 


318 


CGTCAGC GGCTAGCTACAACGA TTATTGA 


787 


UCAGAUG C UAAAGCA 


322 


TGCTTTA GGCTAGCTACAACGA CATCTGA 


788 


UCCCAUA C AUUAUUG 


327 


CAATAAT GGCTAGCTACAACGA TATGGGA 


789 


UCUAUUA U GGGGUAC 


330 


GTACCCC GGCTAGCTACAACGA TAATAGA 


790 


UCUGGUA U AGUGCAA 


331 


TTGCACT GGCTAGCTACAACGA TACCAGA 


791 


UCUUCAG C UACCACC 


332 


GGTGGTA GGCTAGCTACAACGA CTGAAGA 


792 


UGAAUUA U AUAAAUA 


334 


TATTTAT GGCTAGCTACAACGA TAATTCA 


793 


UGAGGGA C AAUUGGA 


336 


TCCAATT GGCTAGCTACAACGA TCCCTCA 


794 


UGGGGUA C CUGUGUG 


353 


CACACAG GGCTAGCTACAACGA TACCCCA 


795 


UGGGUCA C AGUCUAU 


354 


AT AG ACT GGCTAGCTACAACGA TGACCCA 


796 


UGGUAUA U AAAAAUA 


357 


TATTTTT GGCTAGCTACAACGA TATACCA 


797 


UGUCUGG U AUAGUGC 


362 


GCACTAT GGCTAGCTACAACGA CCAGACA 


798 


UGUGGAA C UUCUGGG 


367 


pre AG A A GGCTAGCTACAACGA TTCCACA 


799 


UGUGGUA U AUAAAAA 


368 


TTTTTAT GGCTAGCTACAACGA TACCACA 


800 


UUAAGAA U AGUUUUU 


373 


AAAAACT GGCTAGCTACAACGA TTCTTAA 


801 


UUCAUAA U GAUAGUA 


379 


TACTATC GGCTAGCTACAACGA TTATGAA 


802 


UUGGGAG C AGCAGGA 


386 


TCCTGCT GGCTAGCTACAACGA CTCCCAA 


803 


UUGUGCA U CAGAUGC 


389 


GCATCTG GGCTAGCTACAACGA TGCACAA 


804 


UUUAACA U GUGGAAA 


390 


TTTCCAC GGCTAGCTACAACGA TGTTAAA 


805 


UUUUGUG C AUCAGAU 


395 


ATCTGAT GGCTAGCTACAACGA CACAAAA 


806 
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Table X: HIV env Target and Antisense Sequence 



Sequence 


Seq ID 


Aiicisense 


Oc4 A.XJ 


CAGCAGGAAGCACUAUGGGCG 


396 


CGCCCATAGTGCTTCCTGCTG 


906 


AGCAGGAAGCACUAUGGGCGC 


397 


GCGCCCATAGTGCTTCCTGCT 


907 


GCAGCAGGAAGCACUAUGGGC 


398 


GCCCATAGTGCTTCCTGCTGC 


908 


AGCAG C AGG AAGCACUAUGGG 


399 


CCCATAGTGCTTCCTGCTGCT 


909 


GAGCAGCAGGAAGCACUAUGG 


400 


CCATAGTGCTTCCTGCTGCTC 


910 


GGAG C AG C AGGAAG CACUAUG 


401 


CATAGTGCTTCCTGCTGCTCC 


911 


CGCUGACGGUACAGGC C AGAC 


402 


GTCTGGCCTGTACCGTCAGCG 


912 


ACAAUUG GAGAAGUGAAUUAU 


403 


ATAATTCACTTCTCCAATTGT 


913 


ACGCUG ACGGUACAGG CCAGA 


404 


TCTGGCCTGTACCGTCAGCGT 


914 


AGUUAGGCAGGGAUACUCACC 


405 


GGTGAGTATCCCTGCCTAACT 


915 


CAAUUGGAGAAGUGAAUUAUA 


406 


TATAATTCACTTCTCCAATTG 


916 


GAGUUAGGCAGGGAUACUCAC 


407 


GTGAGTATCCCTGCCTAACTC 


917 


AGAGUUAGGCAGGGAUACUCA 


408 


TGAGTATCCCTGCCTAACTCT 


918 


AUUGGAGAAGUGAAUUAUAUA 


409 


TATATAATTCACTTCTCCAAT 


919 


AAUUGGAGAAGUGAAUUAUAU 


410 


ATATAATTCACTTCTCCAATT 


920 


GACAAUUGGAGAAGUGAAUUA 


411 


TAATTCACTTCTCCAATTGTC 


921 


XJUGGAGAAGUGAAUUAUAUAA 


412 


TTATATAATTCACTTCTCCAA 


922 


UAGAGUUAGG C AGGGAUACUC 


413 


GAGTATCCCTGCCTAACTCTA 


923 


UGCCUGUGUACCCACAGACCC 


414 


GGGTCTGTGGGTACACAGGCA 


924 


AUGCCUGUGUACCCACAGACC 


415 


GGTCTGTGGGTACACAGGCAT 


925 


AUAGAGUUAGGCAGGGAUACU 


416 


AGTATCCCTGCCTAACTCTAT 


926 


CAUGCCUGUGUACCCACAGAC 


417 


GTCTGTGGGTAC ACAGG CATG 


927 


AAUAGAGUUAGGCAGGGAUAC 


418 


GTATCCCTGCCTAACTCTATT 


928 


ACACAUGCCUGUGUACCCACA 


419 


TGTGGGTACACAGGCATGTGT 


929 


CACAUGCCUGUGUACCCACAG 


420 


CTGTGGGTACACAGGCATGTG 


930 


ACAUGCCUGUGUACCCACAGA 


421 


TCTGTGGGTACACAGGCATGT 


931 


GGACAAUUGGAGAAGUGAAUU 


422 


AATTCACTTCTCCAATTGTCC 


932 


AGCAAUGUAUGCCCCUCCCAU 


423 


ATGGGAGGGGC ATACATTG C T 


933 


GCUGACGGUACAGGC CAGACA 


424 


TGTCTGGCCTGTACCGTCAGC 


934 


GCCUGUGUACCCACAGACCCC 


425 


GGGGTCTGTGGGTACACAGGC 


935 


UAUUAUGGGGUACCUGUGUGG 


426 


CCACACAGGTAC CCC AT AAT A 


936 


GCUCCAGGCAAGAGUCCUGGC 


427 


GCCAGGACTCTTGCCTGGAGC 


937 


CAGCUCCAGGCAAGAGUCCUG 


428 


CAGGACTCTTGCCTGGAGCTG 


938 


AGCUCCAGGCAAGAGUCCUGG 


429 


CCAGGACTCTTGCCTGGAGCT 


939 


CUC CAGGCAAGAGUCCUGGCU 


430 


AGCCAGGACTCTTGCCTGGAG 


940 


CCUGUGUACCCACAGACCCCA 


431 


TGGGGTCTGTGGGTACACAGG 


941 


CUGACGGUACAGGCCAGACAA 


432 


TTGTCTGGCCTGTACCGTCAG 


942 


CCAAUUCCCAUACAUUAUUGU 


433 


ACAATAATGTATGGGAATTGG 


943 


AUUAUGGGGUACCUGUGUGGA 


434 


TC CACACAGGTACC CC AT AAT 


944 


UACCCACAGACCCCAACCCAC 


435 


GTGGGTTGGGGTCTGTGGGTA 


945 


UGUCUGGUAUAGUG C AAC AG C 


436 


GCTGTTGCACTATACCAGACA 


946 


CUUGGGAGC AG GAG GAAGCAC 


437 


GTGCTTCCTGCTGCTCCCAAG 


947 


U CUUGGGAGCAGC AGGAAGC A 


438 


TGCTTCCTGCTGCTCCCAAGA 


948 


GUCUGGUAUAGUGCAACAGCA 


439 


TGCTGTTGCACTATACCAGAC 


949 


GUACCCACAGACCCCAACCCA 


440 


TGGGTTGGGGTCTGTGGGTAC 


950 


UUCUUGGGAGCAGCAGGAAGC 


441 


GCTTCCTGCTGCTCCCAAGAA 


951 


UGACGGUACAGGCCAGACAAU 


442 


ATTGTCTGGCCTGTACCGTCA 


952 


UGGCUGUGGUAUAUAAAAAUA 


443 


TATTTTTATATACCACAGCCA 


953 


UGUGCCUCUUCAGCUACCACC 


444 


GGTGGTAGCTGAAGAGGCACA 


i 954 


G ACGGUACAGG C CAGAC AAUU 


445 


AATTGTCTGGCCTGTACCGTC 


955 


UGUGUACCCACAGACCCCAAC 


446 


GTTGGGGTCTGTGGGTACACA 


956 
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UGGGGUACCUGUGUGGAAAGA 


447 


TCTTTCCACACAGGTACCCCA 


957 


GUGUACCCACAGACCCCAACC 


448 


GGTTGGGGTCTGTGGGTACAC 


958 


UAUGGGGUACCUGUGUGGAAA 


449 


TTTCCAC ACAGGTAC CCCATA 


959 


AUGGGGUACCUGUGUGGAAAG 


450 


CTTTCCACACAGGTACCCCAT 


960 


GGCUGUGGUAUAUAAAAAUAU 


451 


AT ATTTTT ATATACCAC AG CC 


961 


UGUACCCACAGACCCCAACCC 


452 


GGGTTGGGGTCTGTGGGTACA 


962 


CCCACAGACCCCAACCCACAA 


453 


TTGTGGGTTGGGGTCTGTGGG 


963 


CUGUGCCUCUUCAGCUACCAC 


454 


GT GGT AG C TGAAG AGG C ACAG 


964 


GCUGUGGUAUAUAAAAAUAUU 


455 


AATATTTTTATATACCACAGC 


965 


CCACAGACCCCAACCCACAAG 


456 


CTTGTGGGTTGGGGTCTGTGG 


966 


GUUCUUGGGAGCAGCAGGAAG 


457 


CTTCCTGCTGCTCCCAAGAAC 


967 


ACCCACAGACCCCAACCCACA 


458 


TGTGGGTTGGGGTCTGTGGGT 


968 


CACAGACCCCAACCCACAAGA 


459 


TCTTGTGGGTTGGGGTCTGTG 


969 


CCUGUGCCUCUUCAGCUACCA 


460 


TGGTAGCTGAAGAGGCACAGG 


970 


AC AGAC C CCAACC CACAAGAA 


461 


TTCTTGTGGGTTGGGGTCTGT 


971 


GGUUCUUGGGAGC AG C AGGAA 


462 


TT C CTGCTG CT C C CAAGAACC 


972 


GCAACUCACAGUCUGGGGCAU 


463 


ATGCCCCAGACTGTGAGTTGC 


973 


UGCAACUCACAGUCUGGGGCA 


464 


TGCCCCAGACTGTGAGTTGCA 


974 


GGGUUCUUGGGAGC AGCAG GA 


465 


TCCTGCTGCTCCCAAGAACCC 


975 


UUGCAACUCACAGUCUGGGGC 


466 


GCCCCAGACTGTGAGTTGCAA 


976 


AUGAGGGACAAUUGGAGAAGU 


467 


ACTTCTCCAATTGTCCCTCAT 


977 


UGUUGCAACUCACAGUCUGGG 


468 


C CCAG ACTGTGAGTTG CAAC A 


978 


UGAGGGACAAUUGGAGAAGUG 


469 


CACTTCTCCAATTGTCCCTCA 


979 


UGAAUUAUAUAAAUAUAAAGU 


470 


ACTTTATATTTATATAATTCA 


980 


GUUGCAACUCACAGUCUGGGG 


471 


CCCCAGACTGTGAGTTGCAAC 


981 


UGGAGAAGUGAAUUAUAUAAA 


472 


TTTATATAATTCACTTCTCCA 


982 


UUGGGUUCUUGGGAGCAGCAG 


473 


CTGCTGCTCCCAAGAACCCAA 


983 


AAAGCCUAAAGCCAUGUGUAA 


474 


TTACACATGGCTTTAGGCTTT 


984 


UGGGUUCUUGGGAG C AG CAGG 


475 


CCTGCTGCTCCCAAGAACCCA 


985 


GGAGAAGUGAAUUAUAUAAAU 


476 


ATTTAT AT AATTCAC TT CTCC 


986 


GAGAAGUGAAUUAUAUAAAUA 


477 


T ATTTATATAATTC AC T TCTC 


987 


AGGGACAAUUGGAGAAGUGAA 


478 


TTCACTTCTCCAATTGTCCCT 


988 


AAGUGAAUUAUAUAAAUAUAA 


479 


TTATATTTATATAATTCACTT 


989 


GAGGGACAAUUGGAGAAGUGA 


480 


TCACTTCTCCAATTGTCCCTC 


990 


AUAUGAGGGACAAUUGGAGAA 


481 


TTCTCCAATTGTCCCTCATAT 


991 


AGAAGUGAAUUAUAUAAAUAU 


482 


ATATTTATATAATTCACTTCT 


992 


CUGUGUACCCACAGACCCCAA 


483 


TTGGGGTCTGTGGGTACACAG 


993 


GAAGUGAAUUAUAUAAAUAUA 


484 


TATATTTATATAATTCACTTC 


994 


UACAAUGUACACAUGGAAUUA 


485 


TAATTC CATGTGT ACATTGTA 


995 


CAGUACAAUGUACACAUGGAA 


486 


TTCCATGTGTACATTGTACTG 


996 


AAGC CUAAAGC C AUGUGU AAA 


487 


TTTACACATGGCTTTAGGCTT 


997 


CCUUGGGUUCUUGGGAGCAGC 


488 


GCTGCT CCCAAG AAC C C AAGG 


998 


AGUACAAUGUACACAUGGAAU 


489 


ATT C C ATGT GT AC ATTGT ACT 


999 


UCAAUAACGCUGACGGUACAG 


490 


CTGTACCGTCAGCGTTATTGA 


1000 


ACAUGUGGAAAAAUAACAUGG 


491 


C CAT GTTAT TTT T CCACATGT 


1001 


UCCUUGGGUUCUUGGGAGCAG 


492 


CTGCTCCCAAGAACCCAAGGA 


1002 


ACGGUACAGGCCAGACAAUUA 


493 


TAATTGTCTGGCCTGTACCGT 


1003 


GUACAAUGUACACAUGGAAUU 


494 


AATTCCATGTGTACATTGTAC 


1004 


UUGUCUGGUAUAGUGCAACAG 


495 


CTGTTGCACTATAC CAGACAA 


1005 


UAAUCAGUUUAUGGGAUCAAA 


496 


TTTGATCCCATAAACTGATTA 


1006 


AGUGAAUUAUAUAAAUAUAAA 


497 


TTTATATTTATATAATTCACT 


1007 


GG G ACAAUUGGAG AAGUG AAU 


I 498 


ATTCACTTCTCCAATTGTCCC 


1008 


UUAUGGGGUAC CUGUGUGG AA 


499 


TT C C ACAC AGGTAC CC C ATAA 


1009 


AAGCAAUGUAUGCCCCUCCCA 


500 


TGGGAGGGGCATACATTGCTT 


1010 
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AAUCAGUUUAUGGGAUCAAAG 


501 


CTTTGAT C CC ATAAACTGATT 


1011 


CAAUAACG CUGACGGUAC AGG 


502 


CCTGTACCGTCAGCGTTATTG 


1012 


GUGCCUCUUCAGCUACCACCG 


503 


CGGTGGTAGCTGAAGAGGCAC 


1013 


GAUAUAAUCAGUUUAUGGGAU 


1 504 


ATCCCATAAACTGATTATATC 


1014 
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1154 


1155 


vo 
in 

H 
rH 


1157 


1158 


1159 


1160 
1161 


CN 
VO 
H 

rH 


1163 


VD 
rH 
H 


m 

VD 
H 
H 


VD 
VD 
rH 
rH 


1167 


1168 


o\ 

VO 

rH 
rH 


1170 


1171 


1172 




IJL174J 




1117SJ 


1177 


1178 | 


1179 


1180 




1182 


1183 [ 


1184 \ 
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Table XII: HIV gp41 peptide sequences 



Peptide Sequence 


SEQ ID NO: 


YTSLfflSLffiESQNQQEKNEQELLELDKWASLWNWF 


1233 


NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ 


1234 
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CLAIMS 

What we claim is: 

1. A short interfering RNA (siRNA) molecule that down-regulates expression of a 
HIV envelope glycoprotein (env) gene by RNA interference. 

2. The siRNA molecule of claim 1, wherein said HIV envelope glycoprotein gene 
encodes sequence comprising Genbank Accession number NC_00 1802. 

3. The siRNA molecule of claim 1, wherein the siRNA molecule comprises 
sequence complementary to a nucleic acid sequence encoding HIV envelope 
glycoprotein or a portion thereof. 

4. The siRNA molecule of claim 1, wherein said siRNA molecule comprises about 
21 nucleotides. 

5. The siRNA molecule of claim 1, wherein said siRNA molecule is double 
stranded. 

6. The siRNA molecule of claim 5, wherein each strand of said siRNA molecule 
comprises about 21 nucleotides. 

7. The siRNA molecule of claim 1 , wherein said siRNA molecule has anti-fusogenic 
activity against HIV entry into a cell. 

8. The siRNA molecule of claim 1, wherein said siRNA molecule is chemically 
synthesized. 

9. The siRNA molecule of claim 1, wherein said siRNA molecule comprises at least 
one nucleic acid sugar modification. 

10. The siRNA molecule of claim 1, wherein said siRNA molecule comprises at least 
one nucleic acid base modification. 

1 1 . The siRNA molecule of claim 1 , wherein said siRNA molecule comprises at least 
one nucleic acid backbone modification. 

12. A method for modulating HIV cell fusion activity in a cell comprising 
administering to said cell the siRNA molecule of claim 1 under conditions 
suitable for modulating said HIV cell fusion activity. 
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13. The method of claim 12, wherein said cell is a mammalian cell. 

14. The method of claim 13, wherein said mammalian cell is a human cell. 

15. A method of treating HIV-1 infection in a subject comprising administering to the 
subject the siRNA of claim 1 under conditions suitable for said treatment. 

16. The method of claim 15, wherein said administration is in the presence of a 
delivery reagent. 

17. The method of claim 16, wherein said delivery reagent is a lipid. 

1 8. The method of claim 1 7, wherein said lipid is a cat ionic lipid. 

19. The method of claim 16, wherein said delivery reagent is a liposome. 

20. A composition comprising the siRNA of claim 1 and a pharmaceutical^ 
acceptable carrier or diluent. 
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